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ABSTRACT 


\ 


in  estimating  the  Circular  Probable  Error^m 


This  thesis  presents' a  general  discussionjof  the  problems  involved 

-referred  fcu  gg1 

A  comparison  is  made  between  the  estimates  of  the  CEP  under 
tTO  distinct  models.  The  models  are  identical  except  for  the  location 
of  the  mean  vector  in  relation  to  the  target.  The  assumption  of  depen¬ 
dence  is  made  in  both  models  and  the  resulting  estimates  are  compared 
with  the  corresponding  estimates  obtained  under  the  assumption  of  inde¬ 
pendence,  Confidence  interval ’estimates  of  the  CEP  are  also  presented. 

Two  methods  of  removing  outlier  or  *ilaverick*  observations  are  intro¬ 
duced  and  some  of  the  possible  effects  on  the  estimated  CEP  are  discussed. 
The  different  estimating  procedures  are  illustrated  with  three  numerical 
problems. 


ii 


The  ter. a  "C-P*  is  familiar  to  most  haval  Officers,  but  the  underlying 
assumptions  upon  which  this  measure  of  effectiveness  vests  are  often 
misunderstood.  Therefore,  it  is  the  objective  of  this  thesis  to  explain 
as  fully  as  possible  what  the  CTF  is  and  to  illustrate  some  of  the 
methods  available  to  estimate  the  CTP. 

The  C3P  was  initially  developed  in  order  to  give  some  criterion  for 
measuring  the  expected  effectiveness  of  a  particular  weapon  system  and  to 
give  some  means  for  comparing  similar  weapon  systems  or  weapons.  In 
order  to  develop  this  criterion,  it  is  essential  that  the  assumptions 
used  are  well  understood  and  established.  The  approach  most  often  used 
is  to  assume  that  the  errors  in  and  across  the  line  of  sight  are  inde¬ 
pendent  and  that  the  variances  are  equal  with  the  justification  that 
these  assumptions  produce  a  negligible  error.  However,  an  error  may  be 
introduced  and  it  is  necessary  to  at  least  understand  what  is  being 
assumed  before  making  judgement  on  the  legality  of  any  assumption.  This 
thesis  therefore,  attempts  to  explain  such  assumptions  and  to  oompara 
possible  results  of  making  certain  assumptions  in  three  example  problems. 
The  problems  are  all  ficticious  and  utilized  only  for  the  purpose  of 
explaining  the  estimating  procedures  and  assumptions. 

The  thesis  is  primarily  directed  at  the  reader  with  a  college  back¬ 
ground  in  calculus,  some  matrix  theory,  and  some  feel  for  basic  proba¬ 
bility  and  statistical  procedures.  The  contents  are  arranged  in  six 
sections  and  throe  appendlcos.  Section  X  is  an  introduction  to  the  prob¬ 
lem  and  the  basie  mathematical  concepts  which  will  be  used.  Sections  II 


and  III  introduce  the  most  commonly  used  estimating  procedures*  Section 
IV  explains  the  problem  of  deleting  outlying  observations  from  the  deter¬ 
mination  of  the  estimate.  Section  V  introduces  the  confidence  interval* 


Section  VI  is  a  summary  of  the  techniques  used  in  the  previous  sections. 

Appendix  A  is  concerned  with  the  mathematical  techniques  which  are  used 

\ 

to  explain  and  transform  the  true  orientation  of  the  dependent  variables* 
Appendix  3  explains  two  methods  of  obtaining  unbiased  estimates  of  the 
CSP.  Appendix  C  explains  in  detail  the  methods  of  integrations  used. 

It  is  recommended  that  Appendices  A  and  C  be  studied  before  starting 
Section  II. 

This  thesis  was  written  during  the  period  January-June  1962  at  the 
United  States  Naval  Post  Graduate  School,  Monterey,  California.  I  wish 
to  express  my  gratitude  to  Professor  J*  R.  Borsting  for  his  continued 
patience,  encouragement,  and  most  competent  guidance  while  acting  as 
faculty  advisor,  and  to  Professor  Max  Woods  for  his  continuous  aid  and 
technical  understanding  of  the  problem  while  acting  as  second  reader* 

I  also  wish  to  thank  my  wife  for  the  moral,  clerical,  and  artistic 
assistance  given  me  during  the  writing  of  this  thesis  as  well  as  the 


past  two  years* 
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SECTION  I 


INTRODUCTION  TO  THE  PROBLEM 

1.1  General  Discussion  of  the  Circular  Probable  Error 

The  problem  of  determining  useful  estimates  of  the  parameters 
which  describe  the  distribution  of  the  fall  of  shot  about  a  target  is 
directly  related  to  the  high  cost  of  testing  expensive  weapon  systems. 
Since  relatively  few  tests  are  allowed  because  of  this  expense,  it  is 
not  improbable  that  a  good  weapon  system  could  be  completely  rejected 
because  of  inefficient  utilization  of  the  small  amount  of  data  available. 
Also,  the  size  and  yield  of  the  warhead  is  directly  related  to  the  esti- 
mated  parameters.  If  the  estimated  variance  is  large,  the  effective 
radius  will  also  have  to  be  large  to  cover  the  target  complex,  and  in 
turn  the  missile  will  not  be  able  to  reach  the  range  of  the  same  missile 
with  a  smaller  warhead.  The  most  efficient  use  of  the  limited  data  will 
thus  greatly  reduce  the  risk  involved  in  reducing  the  warhead  size  and 
increase  the  potential  range.  It  also  may  aid  in  weapon  deployment  or 
assignment  to  larger  targets  because  of  the  greater  confidence  that  can 
be  placed  in  the  estimates.  It  seems  logical  that  if  a  great  deal  of 
confidence  can  be  placed  in  the  weapon,  fewer  weapons  will  have  to  be 
assigned  to  a  target,  thus  releasing  some  weapons  for  other  targets. 

The  Important  point  is  that  the  confidence  placed  on  the  estimators 
must  be  high  enough  to  reduce  the  risk  involved  and  provide  a  sound 
basis  for  decision. 

One  method,  which  is  commonly  used,  to  measure  and  compare  the 
estimated  parameters,  is  called  the  circular  probable  error  or  CSP 
method.  The  CEP  is  defined  as  the  radius  of  the  circle  with  center 
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at  (ux,u^)  which  includes  507.  of  a  bivariate  probability  mass.  The 
illustration  in  figure  (1)  shows  the  form  of  this  function.  It  is  to 
be  noted  that  most  of  the  volume  under  the  curve  is  centered  at  the  tar¬ 
get  and  decreases  as  the  distance  increases  from  the  target.  This  par¬ 
ticular  function  is  well  founded  historically  on  the  basis  of  the 


Bivariate  Probability  Maas 
Figure  1 

The  bivariate  normal  distribution  is  a  generalisation  of  the  normal 
distribution  of  a  single  variate  and  is  bell  shaped  as  show  in  figure  (1) 
above*  Any  plane  parallel  to  the  x,y  plane  that  cuts  the  surface  will 
lnterseot  the  bell  in  the  elliptical  curve  show  in  figure  (2)* 
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Bivariate  Density  Function  which  has  been 
Out  by  a  Plane  Parallel  to  the  x,y  Plane* 

Figure  2 

Any  plane  perpendicular  to  the  x,y  plane  will  cut  the  surface  in 
a  curve  of  the  normal  form  aa  atom  in  figure  (3). 


Bivariate  Density  Function  which  has  been  Out 
by  a  Plane  Perpendicular  to  the  x»y  Plane* 


Figure  3 

The  bivariate  density  function  actually  represente  a  five  parameter 
family  of  distributions)  the  parameters  being  the  means  (u  ,u  ) *  the 

"  y 

variances  (J"\  and  the  correlation  coefficient  This  function  is 

*  y 

synaetrlo  about  the  means  and  has  its  greatest  value  at  the  point  (u  »u  )• 
Xt  should  also  be  noted  that  if  the  errors  in  the  x  end  y  directions  are 
independent  and  the  variances  equal)  then  the  distribution  will  be  in 
the  shape  of  a  bell  with  two  of  the  opposites  sides  "pushed  in"  an 
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Two  Bivariate  Density  Functions  with 

Different  Variances  about  (u  ,u  )j  Side  View 

x  y 

Figure  4 

If  the  variances  are  equal,  a  plane  cutting  the  surface,  as  in 
figure  (2),  will  Intersect  the  bell  in  a  circle. 

The  height  of  the  curve,  forming  the  density  function,  at  any 
point  "a"  is  related  to  the  probability  of  that  point.  Since  this  func¬ 
tion  is  continuous,  the  probability  must  be  expressed  in  the  form  of  sn 
interval  since  the  probability  of  any  single  point  is  zero.  However, 
the  probability  that  a  u..i  vu*iuuie  X,  in  the  distribution  being 
considered,  falls  in  an  interval  is  equal  to  the  area  under  the  curve 
in  the  interval  being  considered.  That  is,  the  probability  that 
a&X4b  is  equal  to  the  area  shown  under  the  curve  in  figure  (5), 

Hote  that  since  the  area  under  the  curve  about  the  point  (u  ,u  )  is  the 

X  y 

greatest,  the  probability  that  the  random  variable  X  fall  in  this  Interval 

is  greater  than  that  of  an  interval  of  equal  length  away  from  the  point 

(u  ,u  ),  nils  is  shown  in  figure  (6), 
x  y 
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a  *>  u. 

Univariate  Density  Function 
Showing  the  Area  Under  Con¬ 
sideration  When  Determining 
P(a<X<b). 


Figure  5 


*  U«  b  “ 

Univariate  Density  Function 
Showing  the  Areas  Under  Con¬ 
sideration  In  the  Intervals 
(a,b)  and  (c,d)  where 
b-a  ■  d-c. 

Figure  6 


1.2  Mathematical  Notation 

X  and  Y  are  said  to  have  a  bivariate  normal  distribution  if  their 
Joint  density  function,  is  givan  ^ 


(1«1)  ^  yC’»y) 


1.2.1 

The  quantity  x  is  said  to  be  an  observed  value  of  a  numerical 
valued  random  phenomenon  X  if  for  every  real  number  x  there  exists  a 
probability  that  X  is  less  than  or  equal  to  x.  In  this  problem  the 
observed  values  of  the  random  variables  X  and  Y  are  the  coordinates 
of  the  data  points  with  respect  to  the  target.  These  coordinates  can 
also  be  referred  to  as  miss  distances  in  and  across  the  line  of  sight. 
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1.2.2 


directions  respectively.  The  mean  of  a  probability  law  is  equivalent 
to  the  expected  value  of  the  random  variable  with  respect  to  the  proba¬ 
bility  law.  This  is  written  as: 

OO  oO 


(1.2)  ux 


(x,y)dxdy 


•o 


(1.3)  u 


s(y)»  r  C y  fXtY(X,y)dxdy 

_.o 


The  mean  value  cannot  be  determined  exactly  in  our  problem  even  if  all 
of  the  missiles  have  been  fired  but  estimates  of  the  mean  values  can 
be  determined  from  the  observations. 


1.2.3 

The  expressions  (x  -  u  )  and  (y  -  u  )  are  the  deviations  from  the 

X  / 

mean  values  in  the  x  and  y  directions  respectively. 

1.2.4 

(Jy  and  Vy  are  the  standard  deviations  in  the  x  and  y  directions 
respectively.  The  standard  deviation  is  defined  as  the  square  root  of 
the  variance  of  the  probability  law.  The  variance  /is  defined  as  the 
seoond  central  moment  of  the  probability  law  and  is  defined  byi 


(1.4)  Y*2  "  -  E(>0)2]  -  2[(X  -  ux)2]  -  E(X2)  -  ux2 

It  should  be  noted  that  the  mean  values  determine  the  location  (ux*uy) 
of  the  center  of  the  normal  density  function  and  the  standard  devia¬ 
tions  (^t  and  Vy )  determine  the  shape  of  the  function  about  the  mean 
in  the  x  and  y  directions  respectively. 
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1.2.5 


The  correlation  coefficient  of  two  jointly  distributed  random 
variables  X  and  Y  is  defined  byp*  CQV  (X.Y)  where 

(1.5)  GOV  (X.Y)  -  SCX  Y)  .  E(X)BCY) 

E(X  Y)  -  /  fxyf  (x.y)dxdy 
.00  *1* 

The  correlation  coefficient  provides  a  measure  of  how  good  a  predic¬ 
tion  can  be  formed  on  one  of  the  random  variables  on  the  basis  of  the 
observed  value  of  the  other  random  variable.  In  other  words,  if  the 
value  of  one  of  the  random  variables  la  given,  the  expected  value  of 
the  other  random  variable  can  be  determined.  This  may  be  written  as 
E(X|Y)  where  the  value  of  X  is  given*  That  is, 

o* 

(1.6)  E(X|Y)  -  £  x  fXJY<xty)dx  where  £x*Y(x,y^ 

is  the  conditional  density  funotlon  of  the  random  variable  X  given 
the  value  of  the  random  variable  Y.  The  conditional  density  function 
la  derived  from  the  conditional  probability  of  a  random  event  A. 
given  a 'random  variable  X.  This  notion  forma  the  baala  of  the  maths* 
metical  treatment  of  Jointly  distributed  random  variables  that  are 
not  independent.1 

In  the  particular  ease  where  two  random  variables  X  and  X  are 
jointly  normally  distributed,  the  conditional  expected  value  of  the 
random  variable  X  given  that  the  random  variable  X  is  some  particular 

*4* 

value  y.  is  a  linear  funotlon.  This. linear  function  is  related  to 
the  orientation  of  the  shape  of  the  density  function  as  shown  in 
Appendix  A. 

1  "Modern  Probability  Theory  and  Its  Applications"  by  Bnanuel  Parson 
/If  of  Stanford  University. 
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In  order  to  simplify  the  notation,  it  will  be  convenient  to 
represent  the  bivariate  density  function  in  matrix  notation.  The 
terms  in  formula  1,1  are  first  arranged  in  the  form 


v< 


I  .  CV~i  l‘AZ 

xirlA’'fi  * 


uhere.^ 


'"1. 


P<5  VP 


Using  this  notation,  we  arc  now  ready  to  look  at  several  models 
investigating  the  CEP  and  confidence  interval  of  the  CEP. 


1.3  The  Basic  Problem  in  Estimating  the  CEF. 

The  problem  of  estimating  the  CEP  is  essentially  that  of  finding 
the  radius  of  a  circle  with  center  at  (u  ,u  )  such  that  the  probability 
is  .5  that  a  random  point  Ci,Y)  will  lie  inside  this  circle.  This  may 
be  expressed  as 


(1.3)  P^-ux)2+(V-uy)2v<  r*>  f>:^Y(x,y)dxdy 

{1 . 


where  fv  --(x,y) 
is  given *ky 
formula  (l.l)# 
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Iii  order  to  introduce  the  problem,  the  assumptions  will  be  made 


that  the  mean  values  are  zero  (u„-u  -0) ,  that  the  errors  in  the  x  and  y 

X  y  '  4 

directions  are  independent  (  f5  "0),  and  that  the  standard  deviations 
are  equal  (^x  *■  T).  The  probability  statement  is  thus  simpli¬ 

fied  to 


jexp  -  (£ 

r-L 

5,7-4  J 

(1.9)  rfk2  +  y2<  r*j  -  i 

In  order  to  perform  the  integration  let  R2  -  :c2  +  Y2  ,  Tan®  -  Y  , 
Y  -  dsin®,  X  «  IlcosS, 

Then  V (R$ r)  -  f}£„  {(rcosS) (rsin03|  J jdrdfi  where  J  «/ dx/dr  dx/d®/1 
''  •‘»i  7  /  dy/Icr  cly/c'9| 


-r 


iW  r 

(1.10)  r  (i^r  )  «  1  f  fx  exp  /-  r2  )  dr  »  1  -  exp  -r2  "  ,5 

Trr* . J  J  \  TJ  “T 

o  o 


Tlierefore,  the  ChT  -  r  «  1.1774T. 

The  problem  of  estimating  the  CEP  is  thus  one  of  obtaining  a 

function  of  the  n  sample  points  (x^y^)., . ^n*^  which  will 

estimate  the  standard  deviation  T .  The  estimators  are  functions  of 
the  observed  values  which  are  used  to  estimate  the  true  values  of  the 
parameters.  Tor  example,  if  m  points  from  a  sample  are  given,  the 
average  or  mean  value  is  estimated  by 


(1.11)  m  Xj  +  x^  +  Xj  ••••••••••  ♦  x^ 

m 

Tlv  distribution  of  x  becomes  closely  concentrated  about  the  true 
value  u5<  as  m  becomes  lerge. 
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There  ere  many  ways  to  estimate  the  parameters  under  investigation, 
and  it  is  therefore  necessary  to  specify  certain  properties  which  are 
desired  in  estimators.  For  example,  the  distribution  of  the  estimator 
should  be  concentrated  near  the  true  parameter  value.  If  and  are 
different  estimators  of  0  with  density  functions  fj(£j)  and  f2(©^)  88 
shown  in  figure  (9),  then^  *•  a  better  estimator  of  0  thanl^. 


The  Density  Rincons  of  Two  Estimators 
Figure  7 


Other  properties  which  are  desired  in  estimators  are  defined  as  follows: 

1.3.1  Relative  Efficiency,  The  relative  efficiency  of  two  estimators 

I 

is  defined  as  a  ratio  of  the  mean  square  errors  of  the  estima- 

'•  \ 

tors.  That  is  - 

(1,12)  8($,  -  9) 2  ■  R»F,  where  R,F,  is  the  ratio  function. 

If  R,F,<  1,  then$|  is  said  to  be  a  more  efficient  estimate 

of  0  than 

1.3.2  Unbiased  Estimator,  An  estimator,  $  is  said  to  be  an  unbiased 

.  ■'«* 

estimate  of  the  parameter  0  if  E(0)  «  0, 

1.3.3  Consistent  Estimator,  An  estimator  $  is  said  to  be  a  consistent 
estimate  of  0  if  P($  40)41,  as  n4W  , 
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1.3.4  Efficient  Estimator,  The  estimators  which  have  the  smallest 
limiting  variances  are  called  efficient  estimators  of  0. 

The  estimators  which  will  be  used  in  the  first  part  of  this  thesis  are 
shown  in  Table  a. 


Table  a 


Properties  of  the  Estimators  Used  in  Itodels  I  and  II 


Parameter 


Estimator 


Properties 


u„ 

•V 


Unbiased,  efficient,  andj 
consistent. 


(x4  - 


5) 2 


Unbiased,  efficient  and  j 
consistent. 


X 


A  more  detailed  discussion  of  certain  eatinatore  under  special 
assumptions  ie  presented  in  Appendix  B, 


1.4  The  Problem  of  Dependence 

Xn  the  gunner)’ -problem,  Jhe  errors  introduced  in  the  line  of  s  it 
are  due  to  variations  in  the  range  and  projectile  initial  velocity.  The 
error  across  tho  line  of  sight  is  due  to  bearing  errors#  Since  bearing 
errors  and  range  errors  are  independent  of  each  other  due  to  the  fact 
that  they  are  obtained  from  different  sources,  the  mathematical  assump¬ 
tion  is  generally  made  that  these  errors  are  also  statistically  indepen¬ 
dent#  However,  if  we  broaden  the  perspective  to  look  at  tho  major  errors 
introduced  in  e  missile  trajectory*  the  major  errors  in  the  lino  of 
sight  end  across  ths  line  of  sight  are  probably  not  independent  of 
eeoh  other. 

Thie  it  primarily  dua  to  faotora  which  did  not  especially  influence 
the  gunnery  fire  oontrol  problem  aueh  aa  errors  in  ship*  a  navigational 


V* 
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position,  errors  introducer!  by  missile  attitude  during  the  time  of 
powered  flight,  especially  at  cutoff,  and  weather  conditions  over  the 
target. 

In  the  .gunnery  problem  there  are  two  types  of  navigational  problems. 
The  first  is  the  relative  problem  of  firing  from  a  moving  object  to 
another  moving  target  where  the  fire  control  problem  is  one  of  obtain¬ 
ing  relative  bearings,  ranges,  courses,  and  speeds.  But  the  firing 
ship's  true  navigational  position  relative  to  the  target  is  not  an  influ¬ 
encing  factor. 

The  second  problem  is  one  of  shore  bombardment  where  the  ship's 
navigational  position  is  determined  by  visual  fix.  This  is  closely 
related  to  missile  launching  except  that  the  first  shot  in  shore  bonbard- 
ment  does  not  have  to  hit  the  target  because  the  shore  observer  can  tell 
the  ship  what  spots  to  apply  to  the  generating  fire  control  solution. 
Therefore,  this  again  becomes  a  relative  fire  control  problem  where 
errors  introduced  by  the  ship’s  and  target’s  relative  positions  are 
corrected  by  spotting.  This  is  not  practical  in  long  range  missile 
launching  because  of  the  inability  to  obtain  corrected  visual  naviga¬ 
tional  positions  relative  to  the  target  due  to  lack  of  observers  at  the 
target  area,  Uhat  is  done  instead  is  that  the  probable  errors  must  be 
predetermined  and  enough  missiles  launched  to  give  a  high  probability 
of  destruction  of  the  target  complex.  If  we  assume  that  the  launching 
ship  is  determined  to  be  at  the  launch  reference  point  then  the  errors 
introduced  arc  os  shown  in  figure  (8>, 
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NORTH 

K 


Byrg  **  firing  bearing  from 
ship  to  position  target 
will  have  at  detonation. 

True  Target  Bearing  Diagram 
figure  8 

Byrg  is  proportional  to  f Lxong  ? 

£'Lyon3  5 

Byrg  '  is  proportional  to  C Lxong  +  Lxong? 

(.  Lyong  +  Lyong  * 

Since  Byrg  differs  from  Byrg*  by  the  errors  introduced  in  and 
across  the  line  of  sight,  the  errors  are  reflected  in  the  q(dgm)s  inter¬ 
polation  computer  as  errors  in  velocity  to  be  gained,  which  have  not  been 
entered.  But  the  errors  introduced  arc  not  independent  because  the  in¬ 
puts  influence  changes  in  velocity  to  be  gained  in  both  range  and  cross 
range  directions  as  shown  below? 


13 


Kui/p  — > 


K«yo 


kuxo 


Flow  Diagram  of  the  Change  la  Velocity  to  bs  Gained 

Figure  9 

In  the  gunnery  problem,  the  weather  conditions  over  the  firing 
ship's  position  are  the  same  as  the  weather  conditions  over  the  target, 
therefore  these  values  can  be  accurately  estimated.  The  missile  firing 
ship  depends  upon  intelligence  and  weather  forecasts  to  predict  the 
inputs  for  target  weather  conditions.  This  information  is  therefore  not 
as  accurate  as  in  the  gunnery  problem.  Since  the  errors  introduced  by 
weather  predictions  influence  the  missile  trajectory  over  the  target, 
the  re-entry  body  is  most  likely  to  be  moved  in  any  direction  and  the 
probability  that  the  errors  in  and  across  the  line  of  sight  are  inde¬ 
pendent  of  each  other  is  low. 

The  errors  introduced  by  missile  attitude  during  cutoff  can  best 
be  illustrated  by  u  vector  dlagra. 
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shot  about  the  target  is  a  random  variable  which  obeys  the  bivariate 
normal  probability  laws*  The  assumption  has  been  made  that  the  errors 
in  and  across  the  line  of  sight  are  not  independent  and  one  of  the 
objectives  of  this  paper  is  to  determine  the  effect  of  this  assumption 
on  the  C2P. 


SECTION  II 


DETERMINING  Il:S  CEP  WITH  THE  DENSITY  FUNCTION  CENTERED 
AT  THE  TARGET :  MODEL  I 


2.1  Introduction 

The  most  important  assumption  made  in  this  model  is  that  u  and  u„ 
are  zero.  This  means  that  the  center  of  the  bivariate  density  function 
is  at  the  target,  Although  this  is  the  desired  condition,  it  may  not  be 
true  initially  due  to  the  complexities  of  the  fire  control  problem.  One 
of  the  determinations  that  is  made  from  the  analysis  of  the  firing  data 
is  whether  a  correction  should  be  made  to  the  fire  control  solution  to 
bring  the  distribution  of  the  fall  of  shot  over  the  target.  Therefore, 
by  starting  with  the  assumption  that  the  center  of  the  distribution  is 
at  the  target  and  finding  that  this  assumption  is  wrong,  it  becomes 
necessary  to  determine  and  apply  the  correction  to  the  fire  control  solu¬ 
tion.  Also,  it  should  be  noted  that  although  this  assumption  may  not  be 
true  initially,  it  still  may  be  true  after  correcting  the  initial  fire 
control  solution. 

If  the  center  of  the  distribution  is  close  to  the  target,  (0,0)  in 
the  coordinate  system,  or  suspected  of  being  so  by  analysis  of  the  test 
data,  the  estimators  determined  from  this  model  may  be  better  estimators 
than  the  estimators  used  in  Model  II  in  Section  III.  A  comparison  can  be 
made  between  model  I  and  model  II,  using  the  criterion  of  relative  effi¬ 
ciency  to  determine  which  model  is  theoretically  the  best.  This  criter¬ 
ion  is  explained  in  Section  III. 

In  this  model  the  errors  in  the  x  and  y  direction  are  assumed  to 
he  non-independent  and  distributed  in  accordance  with  the  bivariate 
probability  Iswt.  The  probability  that  a  random  point  0',Y)  will  lie 
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within  a  circle  of  radius  hy  x  is  equal  to 


(2.1)  F(k,^> -P ^2+V^k7*max) “jj/fX,Y(X*y) dxdy“jJ-^iTpg  exP  -%2'AZdxdy 


|(  X2+Y2  ^  k  T  max  jx2+Y2  <  kV  max 


where  2  and  A  are  defined  in  (1.7), 

In  order  to  integrate  over  this  form,  it  is  necessary  to  first  make 
a  transformation  to  an  orthogonal  density  function.  The  reason  for  this 
is  that  due  to  the  assumption  of  non- independence  ((Vo),  this  density 
function  is  oriented  along  non-orthogonal  lines  called  the  expected  value 
of  X  given  Y  and  the  expected  value  of  Y  given  X  or  in  simpler  notation 
E(XJY)  andE(Y/X)  as  defined  in  Section  1.2.5,  This  orientation  is 


illustrated  in  figures  11a  and  lib 


Three  Dimensional  Diagram  of 
the  Orientation  of  the  Bivari¬ 
ate  Density  Function  where 
0  1,  a+b+e  -  30°. 


Figure  11a 


Two  Dimensional  Diagram  of  the 
Bivariate  Density  Function 
Formed  by  a  Plane  Parallel  to 
the  x,y  Plane  Qitting  the 
Density  Function, 

Figure  lib 


18 


Three  Dimensional  Diagram  of 
the  .ieoriented  .Axes  of  Che 
Divariato  Density  Function. 

Figure  ITa 

?•  ;hc  T.e.ans  of  this  transformation 
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Two  Dimensional  Diagram  of  the 
Deoriented  Divariate  Density 
Function  Formed  by  a  Plane 
Parallel  to  the  u,v  Plane  Gutting 
the  Density  Function. 

Figure  12b 


are  contained  in  Appendix  A. 5. 
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This  transformed  density  function  can  be  handled  more  easily 
because  the  terms  involving  the  correlation  coefficient  have  been  re¬ 
moved.  The  probability  that  a  point  (U,V)  in  the  new  coordinate  system 
will  lie  within  a  circle  with  center  at  the  origin  and  radius  k^is 


(2.5)  P(k,^,^)-  /7g  (u,v)dudv  -  P(ktc)-_  1  ffexp- 

/J  L,v  ifbih 1 1 


where  c  ■  Sf. 

Vu 

(2.6)  F(k,c) 


This  form  is  simplified  in  Appendix  C.5  to 


o 


The  values  of  (k)  for  various  values  of  P(k,c)  and  (c)  are  tabulated  in 

tables  one  and  two.  Table  one  is  used  by  entering  the  table  with 

c  »  5  In  order  to  find  k.  This  table  can  only  be  used  for  P(k,c)-.5. 

% 

Table  two  is  used  by  entering  the  table  with  c  -.5  and  the  probability 

T* 

P(k,c)  in  order  to  find  k.  This  table  can  be  used  for  various  values 
of  P(k,c) . 


2.2  Estimating  the  CEP  using  Model  1 

The  first  step  is  to  find  estimators  forTx,V~y  and  (°  from  the  n 
observed  points  (Xpy1),.,.v,*.«.(xn,yn).  This  is  done  by  computing 
the  sample  variances  'Tx^^y^,  the  sample  covariance  Vxy,  and  the  sample 
corrclatlonjcoeff iclont  r  which  are  defined  as  fojlowsi 

t  •  h 

Til  7f  £  _  _ 

In  these  formulas,  Vx,  v  y,  and  vxy  are  unbiased  estimates  of  vx,vy, 

and  T xy . 
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ihe  transformed  estimates  of  the  variances  are  computed ‘next. 


*Y 


..  Tx  *V‘y3'  *[&*-&? 

U-,7)  \u  -  ^ 

£*  .  vLt  w*  -kfr-fc-fifr,}  , 

v*  ‘  a.  * 

Table  two > is  entered  with  P(k,c)  -  .5  and  c  »  ^  to  find  k. 

^  a  V« 

The  estimate  of  the  CEP  *  CSP^  ■  k  ^ 


2.3  Estimating  the  CEP  using  the  Assumption  that  the  Errors  in  the 
x  and  y  Directions  are  Independent. 

If  it  has  been  assumed  that  the  errors  in  the  x  and  y  directions 
are  independent,  an  estimate  of  the  CEP  can  be  obtained  by  using  the 
estimators  in  model  I  except  that  the  estimated  variances  Vs  and  V-  $ 

rr  2  ^  2 

are  used  instead  of  the  estimated  transformed  variances  v  u  and  v 

a  /N  »  Ci- 

c*  ■  Tmin  where  Tmin  -  Min(vx.vy) 

■^max  ^max  -  Max(^  x,^“y) 


P(k*,c*)  -  .5  . 


Table  two  is  entered  with  P(k*,c*)  and  c*  in  order  to  find  k*. 

A 

Then  this  estimate  of  the  CEP  ■  CEPJ  ■  k*  V  max, 

2.4  Information  About  the  Problems. 

In  the  problems  which  follow,  both  estimates  of  the  CEP  will  be 
obtained  in  order  to  compare  the  results  in  the  summary  in  Section  VI. 


2.5  Example  Problems 

The  problems,  which  will  be  used  to  compare  methods  of  estimating 
the  CEP,  have  been  set  up  in  three  cases.  The  first  case  will  have  ten 

sample  points  (x^y^) . .....(x^y  )  and  is  representative  of  the 

point  in  time  where  some  initial  decision  may  be  made  as  to  whether  the 
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stem  should  be  accepted,  rejected,  or  that  more  tests  should  be 


coni. t. 


ducted,  The  second  case  will  have  fifteen  sample  points  (x^.y^).,., 

. (xir,yi5)  which  will  include  the  first  ten  sample  points.  This  is 

intended  to  represent  an  intermediate  point  in  time  where  some  terminal 


decision  may  be  made  on  the  acceptance  of  the  weapon  system.  The  third 

cas  >  will  consist  of  twenty  five  sample  points  (x^,y^), . **  ^*25*’*'^  * 

It  should  be  noted  that  as  the  number  of  observations  increase,  the 
estimators  are  more  likely  to  be  closer  to  the  true  values.  The  actual 
distributions  of  the  25  points  are  shown  in  diagrams  1,2,  and  3.  The 
coordinates  of  the  points  are  as  follows: 


I  rob 1cm  1 

Problem  II 

Problem  111 

Case 

X 

y 

X 

y 

X 

y 

1. 

-3.0 

•  1.0 

•5.8 

3.6 

-3.6 

-11. s 

2. 

-2.2 

5.0 

-2.6 

l.G 

-3.6 

3.2 

3, 

-1.0 

1.0 

0 

1.0 

-1,6 

-  .2 

4. 

-  .  u 

-  .6 

1.3 

1.0 

•3.0 

•  ftl> 

5. 

w’ 

3.0  - 

-1.6 

•  1.0 

1.2 

-  2.2 

I 

/• 

“*• 

1.0 

0 

.5 

•  1.0 

0  / 

-  1.2 

7. 

3.6 

-2.0 

3.0 

-  .6 

l.Q 

4,2 

• 

3.0 

1.0 

1  “ 

-2.4 

.4 

1.6 

1 

i 

3.0 

4,0 

|  -1.0 

-4.0 

1.3 

.4 

i  '  * 

! 

ia*j 

1.0 

_ ,,.4  . . 

L-4,0 _ -2,0 
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The  value  of  the  CBP  obtained  using  the  estimators  from  this 
section  will  be  compared  to  the  estimates  of  the  CEP  from  Sections  III, 
IV,  and  estimators  which  are  explained  in  Appendix  B,  This  comparison 
will  extend  to  the  problem  of  rejecting  outliers  and  the  comparison 
will  be  presented  in  Section  VI, 

Although  these  problems  are  primarily  oriented  at  tests  involving 
the  more  expensive  weapon  systems,  such  as  the  IR3K,  the  environment 
can  be  extended  to  less  expensive  weapon  systems  which  will  naturally 
have  more  sample  points.  Although  it  was  intended  to  make  the  problems 
as  realistic  as  possible,  no  attest  was  made  to  utilise  data  from 
actural  missile  firings. 
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i-roblem  III,  Case  I.  Data  points  and  computational  results. 
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SECTION  III 


DETAINING  THE  CEP  WHEN  THE  DENSITY  FUNCTION  IS  CENTERED 

AT  THE  POINT  (u  ,u  ) !  ID  DEL  II 
x  y 

3.1  Introduction  ; 

The  tnost  important  assumption  made  in  model  II  is  that  if  an 

infinite  number  of  tests  were  conducted,  the  mean  values  of  x  and  y 

would  be  u  and  u  respectively.  This  means  that  the  center  of  che 
x  y 

bivariate  normal  density  function  is  at  some  point  (u  ,u  )  with  respect 

x  y 

to  the  target  at  (0,0). 

If  enough  tests  have  been  conducted  to  ascertain  that  this  density 
function  is  offset  from  the  target  through  the  utilization  of  the  esti¬ 
mators,  then  it  may  be  possible  to  enter  a  spot  (-u  ,-u  )  to  correct 

x  y 

the  fall  of  shot. 

In  this  model  the  errors  in  the  x  and  y  directions  are  assumed  to 
be  non-independent  but  are  distributed  in  accordance  with  the  bivariate 
normal  probability  laws. 

The  probability  that  a  point  (x,y)  whose  coordinates  are  chosen 

at  random  will  lie  within  a  circle  of  radius  k^max  with  center  at 

(u  ,u  )  is  equal  to 
x  y 

(2.1)  1  [/(X  -ux)2+(Y  -uy)2  ^  kV'maxj-  j^v .  y(x,y)<ixdy  - 

f  (x-u  )2+(y-u  )  2  /  jt  Wax 
1  x  y 


l  Z’A^ixdy 

/  ,x-uv)2  +  (y-u  )2<  kTmnx 

*  / 
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In  order  to  integrate  over  this  form,  it  is  necessary  to  first 
translate  the  axes  before  making  the  transformation  because  the  density 
function  is  oriented  along  non-ortho gonal  lines  away  from  the  center  of 
x,y  coordinate  system.  This  orientation  is  shown  in  figures  13a  and 


ellipse  formed  by  a  Plane  Parallel  to  the  x,y  Plant  Cutting  the  Density 

function  with  Center  at  (u  ,u  ) 

x  y 

figure  13b 
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The  translation  is  made  by  subtracting  the  means  (u  ,u  )  from  their 

A  y 

resnective  random  variable  X  and  V.  That  is  simply  (X  »  uv)  and  (”  ■  u  ) 

y 

where  in  this  case  the  matrix  2  now  becomes  2 

I 

Y  -  u 


The  transformation  is  then  of  the  same  form  as  the  one  in  Section  II. 


3.2  Estimating  the  CEP  Using  2-iodel  II 

The  first  step  is  to  find  estimators  for  u  ,u  ,  (T  »T  »  ant*  1°  from 

^  y  X  ’y 

the  n  observed  points  (x, ty.) (x.,y  ) . . (x  ,y  ).  This  is  done  by 

44  n  n 

first  computing  the  sample  means  x,y  and  then  computing  the  sample 

/A  ,  ^ 

variances  V*  /  4  the  sample  covariance  'Tlryand  the  sample  correlation 


coefficient  r  as  follows: 


x  " 


n 


- 

V  K3  —A 

*  n 


S'xi-x>' 

rt-  — — 


.  •  >->2 


n-1 


x)  ^i  *  y> 

T**m  - set - 


£  _  Txy 


T*  Vy 


The  transformed  estimates  of  the  variances  are  then  computed  using 
formulas  (2.6),  Table  1  or  2  is  entered  with  P(k,c)  ■  .5  and 

c  ■  Ty  to  find  k.  The  estimate  of  the  CEP  »  “Set.  ■  kV^  , 

“  u 


3.3  Estimating  the  CEP  Using  the  Assumption  That  the  Errors  in  the 
x  and  y  Directions  Are  Independent. 

If  it  has  been  assumed  that  the  errors  in  the  x  and  y  directions 

arc  independent,  an  estimate  of  the  CEP  con  be  obtained  by  using  the 

^  ^  • 

estimators  in  todel  II  except  that  the  estimated  variances  Txz  andT2 


fr-51  rr  •** 

are  used  instead  of  the  estimated  transformed  variances  Vy  and  » 

^  A 

V"  min 


Then  c*  - 


^  max” 


where 


5L  =  <*<#  (G,^) 


V*m*x  /hax(W/  5y) 


Table  1  or  2  is  entrred  with  P(k*,c*)  and  c*  in  order  to  find  k*. 

£ 

Then  the  estimate  of  the  CSP  is  CEP*0  —  k*7  » 

i  max 

3.4  Comparison  of  Models  I  and  II 

If  Model  I  is  the  true  situation,  then  the  estimator  defined  in 

Section  II  is  the  most  efficient  estimator.  If  the  mean  is  not  at  (0,0), 

(Model  II)  then  it  still  may  be  advantageous  to  use  the  estimate  given 

for  Model  I  if  (u  ,u  )  is  not  too  far  away  from  the  origin  and  if  the 
x  y 

sample  size  is  maall.  This  is  because  tvx>  degrees  of  freedom  are  lost 
in  estimating  (uxtuy)»  This  problem  is  treated  in  Appendix  B  using  the 
criterion  of  relative  efficiency* 

3.5  Problem  Set 
Problem  I,  Case  I 
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Problem  II,  Case  I 
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SECTION  IV 


RS-DVING  THE  OUTLIER:  M03EL  III 


This  model  covers  the  problem  of  outliers  and  attempts  to  show 
some  of  the  reasons  for  eliminating  the  outliers  from  consideration  in 
the  determination  of  the  estimates  as  well  as  several  methods  for 
eliminating  them.  * 


4.1  Introduction  to  the  Froblem. 

The  general  problem  of  removing  outliers  is  related  to  the  fact 
that  it  is  desirable  to  obtain  estimates  of  the  parameters  for  the 
underlying  bivariate  density  function  which  are  not  biased  by  obser¬ 
vations  from  a  distribution  different  from  this  underlying  distribu¬ 
tion.  This  in  turn  will  yield  more  accurate  estimates  of  the  CEP. 

It  is  necessary  to  safeguard  the  estimate  of  the  CEP  from  the  ill 
effect  of  including  information  in  the  analysis  that  is  not  due  to 
variations  in  the  population  of  missiles,  but  is  caused  by  some  other 
factors  such  as  weather  or  human  errors.  It  is  also  possible  that 

observations  which  have  large  deviations  from  the  other  observations 

/ 

may  come  from  different  distributions  due^tsf  inyrovement  in  the 
missile  design.  This  is  especially  tru^ during  the  missile  develop¬ 
ment  stages  where  each  succeeding  missile  has  inproved  or  different 
subsystem  components  than  proceeding  missiles.  Ebr  example,  an  im¬ 
proved  fuel  may  not  be  correctly  compensated  for  in  tho  missile  guid¬ 
ance  end  fire  control  computers  or  a  new  type  switch  may  not  function 
quite  os  initially  designed.  Thu  combination  of  changes  nay  influence 
the  range  of  the  missile  so  that  it  loads  farther  from  the  target  than 
predicted.  If  compensation  Is  correctly  made  for  the  succeeding  shot, 
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it  scons  reasonable  that  the  observation  for  the  first  shot  should  not 
be  included  in  the  determination  of  estimates  for  the  CEP. 

Also,  as  improved  subsystems  are  added  to  the  missile,  it  is 
possible  that  the  earlier  missiles  will  not  have  the  same  density 
function  as  the  later  missiles  and  thus  have  a  different  CEP.  In  this 
case,  it  may  become  necessary  to  include  only  the  later  developed 
missiles  in  the  determination  of  the  CEP.  Due  to  the  fact  that  the 
missile  development  will  be  a  continuing  process  with  each  missile 
slightly  different  than  the  proceeding  one,  it  may  not  be  easy  to  dis¬ 
tinguish  between  these  distributions.  This  is  because  both  distribu¬ 
tions  will  have  some  observations  close  to  the  target  and  others  away 
from  the  target.  The  figures  below  may  help  to  illustrate  this  point. 


Observations  from  Two  Density  functions  of  Two 

Distributions  about  the  Target  Distributions  about  the  Target 

::  first  population 

0  second  population 

Figure  14a  Figure  14b 
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It  should  he  noted  in  figure  14b  tlvatdistribution  I  has  some 
probability  of  occuring  in  distribution  II,  If  this  probability  is 
large,  it  may  be  extremely  difficult  to  separate  the  two  distributions. 
In  fact,  if  it  is  desired  to  separate  the  two  distributions,  there  is 
some  probability  that  observations  belonging  to  the  underlying  distri¬ 
bution  under  consideration  will  be  removed  along  with  the  observations 
from  the  distribution  that  is  not  being  considered.  Thus  one  of  the 
problems  in  removing  outliers  is  to  keep  the  probability,  that  the 
observations  removed  as  outliers  which  do  in  fact  belong  to  the  under¬ 
lying  distribution,  as  low  as  possible.  If  this  probability  is  small, 
it  is  possible  that  the  observations  belonging  to  the  underlying  dis¬ 
tribution  which  are  still  removed  will  have  such  a  low  probability  of 
occurence  that  their  removal  will  still  lead  to  a  better  estimate  of  the 
parameters.  This  may  be  especially  true  for  small  sample  sizes  where 
one  sufficiently  large  or  small  observation  can  totally  ruin  an  analysis 
of  the  data.  Therefore,  in  order  to  eliminate  an  arbitrary  result,  it 
is  necessary  to  establish  some  criteria  for  eliminating  these  outlying 
observations. 

4,2  Criteria  for  Rejection  of  Outliers 

Naturally  shots  which  land  at  long  distances  from  the  target  can 

be  easily  identified  as  wild  shots  or  outliers  with  possible  unknown 

errors,  3uc  as  the  observations  move  closer  to  the  target,  it  becomes 

necessary  to  utilize  some  type  of  probabilistic  consideration  for  the 

rejection  of  outlying  observations.  One  way  to  approach  a  solution  to 

this  problem  is  to  set  it  up  as  a  hypo  Cues is  testing  problem.  Cn  the 

basis  of  the  observations  (*, ,y,).. ,y_),  a  test  is  made  of 

11  n  n 


t . 
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the  hypothesis  that  the  observed  point  (x^yj)  belongs  to  the  underlying 
distribution.  The  test  is  then  conducted  for  each  (xj,yj)  for  i  ■  1,2, 

. .  one  at  a  time.  The  alternate  hypothesis  is  then  that  the 

observed  point  (x^y^  does  not  belong  to  the  underlying  distribution 
but  to  some  different  distribution.  This  can  be  written  as: 

H0:  fx,Y^Xi»yi5  “  £X0  Y0^Xi*yi^  f°r  each  1  "  1 . n» 

"l!  >  {%tY0<Vyl)  ”he«’  \x0(xi  i>  u  che 

true  underlying  distribution. 

The  probability  of  a  Type  I  error  will  be  called  v  x*here  v  is  the 
probability  of  rejecting  the  hypothesis  that  the  point  oes 

belong  to  the  underlying  distribution  when  in  fact  it  does  belong  to  the 
underlying  distribution.  This  can  be  expressed  as 

Prob  [jfype  I  error}  -  v 

The  probability  of  accepting  the  hypothesis  that  some  point  (x^y^)  does 
belong  to  the  underlying  distribution  when  the  point  does  not  belong  to 
the  underlying  distribution  and  is  called  the  Probability  of  a  Type  II 
error. 

Thus  the  probability  of  the  Type  I  error  may  be  called  the  risk 
that  the  experimenter  is  willing  to  take  in  making  a  mistake  by  rejecting 
a  point  (X|,y})  as  an  outlier  which  does  in  fact  belong  to  the  under* 
lying  distribution  even  though  the  observed  value  does  exceed  some  value 
specified  by  the  criteria.  Naturally,  it  is  desirable  to  try  to  keep  v 
small  but  if  v  is  too  small  then  the  Typo  II  error  will  increase  and  all 
outliers  will  be  included  in  the  determination  of  the  parameters. 
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4.3  Method  I  For  the  Rejection  of  Outliers. 

This  method  for  the  rejection  of  outliers  is  based  on  the  probability 
that  a  random  point  (X,Y)  will  lie  within  the  ellipse  Z’AZ  -  k.  Z'AZ  is 
the  matrix  notation  for  the  quadratic  form  of  the  dependent  bivariate 
normally  distributed  random  variables  X  and  Y.  That  is, 


(4.1) 


k  is  defined  by 


(4.2) 


P  (  Z*  AZ  s<  k2)«  ff  f„  v(x,y)dxdy»l. 
Z»AZ<k2 


Geometrically  it  is  the  probability  that  the  point  (X,Y)  will  lie 


inside  the  ellipse  made  by  a  plane  parallel  to  the  x,y  axes  cutting  the 


density  function  as  shown  in  figure  15. 


Offset  ellipse  Made  by  a  Plane  Parallel  to  x,y  Axes 
0uttin3  the  Density  Function 


Figure  15 


Due  to  the  orientation  of  this  density  function,  it  is  necessary  to 
make  the  transformation  to  the  orthogonal  u*v  coordinate  system  in  order 
to  integrate  over  this  form.  This  transformation  is  made  in  the  same 
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manner  as  in  Sections  II  and  III,  The  probability  can  now  be  expressed 


as 


(4.3)  P(W»A*.i<kf>-  jjg,,>v(u,v)dudv  -  jfex pC-ljWA^dudv 

WA*»i<b  u '  WA*V/<tf 

where  +  v^ 

V,4  Vy4 

letting  T2  -  +  v£  ,  (4,3)  reduces  to 

(4.4)  P(T2<k2  )  »  f  J<exp (-%t)  dt  3 

The  random  variable  T  has  the  Chi  Squared  distribution  with  two  degrees 
of  freedom.  The  above  formula  is  a  special  case  of  the  following  result. 
If  2^  are  independent  and  normally  distributed  random  variables  with 
means  Uj  and  variances  ,  then 

a. 


«-5>  T,  -|(vp)i 


The  degrees  of  freedom  m  is  the  number  of  independent  terms  in  the  sum. 
The  density  function  of  Tm  is 

cx/>(“£0 


(4,5)  fT  (t)  -  — - imr 

m  (?-#)/ 


t  >  0 


t  i  0 


The  areas  under  this  density  function  are  partially  tabulated  in  Table  4. 

The  desired  percentage  of  the  area  under  this  curve  is  found  by  entering 

Table  4  with  1  •  v  and  the  degrees  of  freedom  m. 

The  decision  rule  that  is  used  for  the  elimination  of  outliers  is 

to  state  that  an  observation  is  an  outlier  when 

3  "Ir.c.-c;*.  rtion  to  the  Tl.eo.y  of  Statistics"  by  A.  iiood  /2/  of 
Hand  Cnipor  l'  ion. 


4.4  Method  II  For  the  Rejection  Of  Outliers. 

nils  method  for  the  rejection  of  outliers  is  based  on  the  probability 
that  a  random  point  (X,Y)  will  lie  within  a  circle  of  radius  k 7~mav» 

Then,  letting 

r  -  £(x  -  ux)2  +  (y  -  Uy)^J,  k  is  defined  by 

(4.8)  P  [[(X  -  ux)2  +  (Y  -  uy)2J<kArJ-  Jfx^(x,y)dxdy  =  1-v 

Vmojt 

Geometrically,  it  is  the  probability  that  the  random  point  (X,Y)  will 
lie  inside  the  circle  Imposed  on  the  quadratic  form  made  by  a  plane 
parallel  to  the  x,y  axes  which  cuts  the  density  function  as  shown  in 
Figure  16. 
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Illustration  Showing  the  Circle  of  Interest 
which  is  Imposed  on  the  Ellipse  hade  by  a  Plane 
Parallel  to  the  x,y  Plane  Cutting  the  Density  Function 

Figure  16 


Due  to  the  orientation  of  this  density  function,  it  is  also  neces¬ 
sary  to  make  the  transformation  to  the  orthogonal  u,v  coordinate  system. 
The  geometrical  areas  under  consideration  are  shown  below  in  figure  17 
for  this  transformed  density  function. 


Illustration  Showing  the  Circle  of  Interest 
which  is  Imposed  on  the  Ellipse  made  by  a  Plane 
Parallel  to  the  u,v  Plane  Cutting  the  .-ensity  Function 

Figure  17 
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It  should  be  note!  that  this  method  will  reject  points  outside  the 
circle  but  inside  the  ellipse  v.’hich  is  estimated  from  the  data  points. 
Therefore,  unless  the  variances  are  equal,  this  method  will  generally 
reject  points  farther  from  the  target  than  method  I,  since  some  points 
on  or  near  the  major  axis  will  be  outside  the  circle  os  shown  in 
figure  17.  The  circle  is  necessarily  of  smaller  diameter  than  the 
major  axis  of  the  ellipse  unless  the  variances  are  equal  and  then  the 
circle  and  ellipse  will  be  synononous.  This  can  be  seen  from  the  follow¬ 
ing  inequality: 

(4.9)  X2  +  V2  ■>,  X2  +  Y2  where  CT2.  ■  max  (7x2.  \/y2) 

jp  f 72  *  "ff*- 

J  V  max 


The  probability  that  the  point  (l’,V)  in  the  transformed  coordinate 
system  will  Me  within  the  circle /d 2  +  V2  ■  k^ij  is  expressed  as 


(4.10)  i'[ju2+V^  kVi J 


F(k,c)  where  c- 


This  formula  is  the  same  general  formula  that  was  used  for  the  determin¬ 
ation  of  the  CSP  except  that  .5  has  now  been  replaced  by  (1-V)  in  the 
range  from  ,5  -*1.  The  decision  rule  that  is  used  for  the  elimination 
of  outliers  is  to  state  that  an  observation  is  an  outlier  when 

(4.11)  »  Z'AZ^kj  where  kj  is  obtained  from  table  2  by  entering 

with  1- y  and  the  value  c-  $v  •  It  should  be  noted  that  this  value 

Vu 

of  k  defines  the  radius  of  the  circle  centered  at  (u  ,u  )  which  includes 

x  y 

(1-V)100T,  of  the  bivariate  probability  mass.  The  value  of  k  obtained 
from  method  I  defines  the  ellipse  which  includes  (l-\/)1007.  of  the 
bivariate  probability  mass. 
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■'..5  Irocec'  re  far  xcmovtng  Outliers  '.’sin'*  :  ctho tl  I  or  ..ethod  II. 

al  procedure  to  remove  the  outliers  differs  from  the  :'is» 
.".sio:-.  in  Sections  4,3  and  4.4  in  that  the  probability  l-/is  only 
exact  if  the  true  values  of  the  parameters  ux>u,,»  7"^*  \Ty  and  ^  are 
used.  It  should  be  noted  that  both  procedures  substitute  estimates  of 
these  parameters  for  the  true  values  and  therefore  the  probability  of 
Type  I  error  is  not  exactly  equal  to  V  .  .'he  first  step  is  to  find 
estimators  for  uj(Su and(°  from  the  n  observed  points  (x^ty^).,, 

. ... (x  ,y  ) .  This  can  be  done  using  either  model  I  or  model  II  from 
L'ections  II  and  III  respectively.  The  model  used  depends  on  which  basic 
assumption  is  made  about  the  true  values  of  the  means  (ux»Uy).  If  it 
is  assumed  that  ux“Uy«»04  then  model  I  can  be  used.  If  it  is  assumed 
that  uv.  i4  0,  and/or  u  /*0,  then  model  II  can  be  used.  Also,  the  criterion 
of  relative  efficiency  can  be  used  to  determine  whether  model  I  or 


model  II  should  be  used.  The  estimates  of  the  parameters 

oy.  »**»''*»  are  then  computed  by  using  the  selected 


model. 


The  estimated  value  of  the  matrix  A  is  computed  next  using  the  above 
estimates. 


(4.12) 


-A 

•  ”  1 


5- 


*"r  -£  J-  J 

\  fry  ' 

hormolly  the  value/  Is  predetermined  by  the  experimenter  and  the 
outlier  rejected,  on. the  basis  of  this  value.  It  is  advisable  to  delete 
the  outliers  one  at  a  time  until  all  of  the  data  points  are  inside  the 
region  prescribed  by  the  probability  !-/ and  the  method  used.  This  is 
due  to  the  fact  that  the  estimated  shape  of  the  curve  is  dependent • upon 
the  data  points  and  each  deleted  point  will  produce  some  change  in  the 
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* 


•  stiuaiu-'  shape  o  '  ulio  density  Junction,  .lie  first  outlier  is  removed 
by  investigating  the  points  farthest  from  the  estimated  mean  value 
and  the  point  (x^y^)  is  deleted  whose  estimated  quadratic  form 
is  greater  th.an  kj  (for  method  I)  or  k|  (for  method  II).  If  there 
are  two  or  more  points  which  satisfy  this  requirement,  the  point  is 
deleted  first  which  has  the  greatest  valued  quadratic  form. 

It  is  then  necessary  to  recompute  the  estimators  and  use  the  above 
procedures  again,  thus  removing  outliers  one  at  a  time,  until  there  are 
no  points  left  with  estimated  quadratic  forms  greater  than  (i**lor2) . 
The  final  estimate  of  the  CEP  is  then  determined  from  the  estimators 
derived  using  the  data  from  the  remaining  observations.  This  estimate 
of  the  CEP  will  be  referred  to  as  ^2i  where  the  subscript  i  refers 
to  the  number  of  data  points  removed. 

4.6  Information  About  the  Problems, 

In  order  to  illustrate  the  above  methods,  the  sample  problems  given 
in  Section  2,5  were  used,  1-todel  II  was  chosen  arbitrarily  for  estima¬ 
ting  the  parameters  for  illustrative  purposes,  3oth  methods  of  reject¬ 
ing  outliers  were  set  up  for  each  problem  case  but  instead  of  rejecting 
outliers  with  any  specific  probability,  the  tables  were  set  up  to  show 
the  probability  that  a  specific  data  point  could  be  rejected.  This  was 
done  in  order  to  compare  the  two  methods. 
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iroblem  I,  Case  I.  Hat  a  points  and  computational  results. 
I  9 


^  I 


© 


/ 


i 


£ 


Al  i 


T 


Data  Points  in  Problem  I, 
•„  Diagram  22 


=10 


Step  1.  In  order  to  reduce  computa¬ 
tion,  it  is  only  necessary  to  find  the 
maximum  values  of  in  each  of  the 

steps  in  removing  the  outliers. 


lO^lo”5, 

27,  ^AZ2»3.32, 

Z'AZ  -3.31 
l  1 

1-V 

method  1 

*T  | 

1 

.bet hod  2 
ka 

<S 

.90 

4.61 

- 

os 
. .  j 

i 

5.99 

4.5  ; 

.975 

'  7.33 

5.65  ; 

Conclusion:  Z'fc  for  point  10  is  gi*eatest  and  can  be  removed  v?ith 
‘OS  probability  by  Method  1  and  95 %  probability  by  Method. 2. 


x 


The  xt computed  estimators,  after  deleting  point  10  are  then 
•  %  y  "  1.2,  ^  ■  2,94,  Vy  ■  2.33,  ■  .41,  P  ■  .06,  VJ,  •  2.9,^* 


2.4 


Dependent 

Independent 

ijodel 

:odcl  ' 

c  ■  ,31 

c*  -  .SI 

k  -  1.05 

k*  -  1.05 

CSP21-3.1 

CSr*21"2«6 

Step  2.  Vhe  procedure  should  now  be  continued  with  the  9  retaining  data 
points  to  determine  if  any  of  the  remaining  data  points  can  be  removed 
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with  o  spcci.:ic.'  probability  oT  90!»  veins  tr.c tliod  1  or  . 5'.',  probability 
usins  lathed  2,  In  this  p  rob  lot:  there  arc  no  i:«rc  outliers. 


Problem  I,  Case  II.  Da ta  points  and  computational  results* 


Data  Points  in  Problem  1,  >15 


Diagram  23 


Conclusion!  2'&  for  data  point  13  is  greatest  and  can  be  removed 
with  .7,57.  probability  by  Method  1  usd  Method  2, 

The  recomputed  estimators  after  deleting  point  15  arc  then 

*  -  1.2,  y  «  1,6,  $»•  2,62,  Vy-  3.44,  $y»  3.45, -  3.63,  %  -  2.26, 

* 

P  -  .333 


dependent 

;odd 

o  •  ,632 

k  -  ,‘,54 

d*.-3,41 


Independent 

.odcl 

C*"  .762 

UW.03V 

3.fd 
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deep  2,  .lie  procedure  is  now  continued  with  the  14  remaining  data 
points  to  determine  if  any  of  the  remaining  points  can  be  removed  with 
a  specified  probability  of  :'5Z, 


zh>\  ■  5-M 

1-V  _ 

Method  1  | 

Method  2  j 

j 

.95 

5. 95 

4.41  1 

L  .3”  _ 

7.38  , 

5.54 

Conclusion)  Z'A 2  for  data  point  10  is  the  largest  and  can  be  removed 
with  0 57.  probability  by  method  2  but  would  not  be 

renioved  ns  an  outlier  by  method  1.  For  purposes  of  illustration,  this 
data  point  will  bo  removed.  The  recomputed  estimators  after  removing 
point  13  arc 

x  -  .3,  y  -  1.7,  %*  2.47,$-  2.71,  .  51,  t"  2.79,  V"  2.37, P-  .136 


dependent 

Independent 

lodel 

Model 

o 

■ 

u»  j 

c^-.vr 

;  k  -1.09 
, 

!«*- 1,  /& 

1 

C2l>22-3.04 

c»*  - 

22 

Step  3,  The  procedure  is  again  continued  with  the  13  remaining  data 
points  to  determine  if  any  of  the  remaining  points  can  bo  removed  with 
a  specified  probability  of  23Z»  In  this  example  there  ore  no  more  outliers. 
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i'roblem  II,  Case  I.  Data  points  and  computational  results 


Data  Points  in  Problem  II,  M— 10 
Diagram  24 


tep  1, 

Z'AC  -  6. 1C 

1  1 

ji-v 

Method  1 

Method  2 

1.95 

| 

5.99 

- 

1.975 

i 

;  7.33 

5.22 

1.99 

6.35 

Conclusion i  l*tz  for  point  1  is  groatest  and  con  be  removed  with 
-57,  probability  by  asthod  1  and  91*%  probability  by  method  2. 


:ho  recomputed  estimators  after  deleting  point  1  are 

S  ■  -.3.  7  -  2,15,^-  1,32,  .79,?-  ,201,  2.23, 

Ty-  1.72 


Dependent 

Independent 

l&del 

Model 

c  -  ,770 

c*-  ,  w 

% 

k  -1.039 

k  *"1,08$ 

$S’2l-2,3l 

a.3v 

3tep  2*  Tit*  procedure,  using  tho  9  remaining  points,  does  not  reject 
any  rare  dots  points  in  this  problem. 


© 


dose  il,  ;aca  points  and  computational  results. 


?  ; 


;tep  1, 


:|AZ  -  7.33 


Data  Points  in  Problem  II,  N-15 
Diagram  25 


Conclusions  D'ftz  for  point  1  is  greatest  and  can  be  removed  with 
957.  probability  by  method  1  and  99fi  probability  by  mathod  2. 

The  recomputed  estimators  after  deleting  point  1  are 

y.  -  -.5,  y  ■  2.16,^-  2,60,^-  -.2,  (  »  -.03, 5*-  2,69,^-  2.13 


Dependent 

Independent 

lodel 

Model 

c  -  .003 

c*-  ,90 S’ 

V 

U  -1.076 

k 

S'2l-2,?0 

cdi*  -a. fA 

L  21 

3 top  2.  The  procedure  using  the  14  remaining  points  docs  not  reject 
any  moro  data  points. 
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5 

i 


Data  points  in  Problem  III,  II— 10 

uiasra^  26 


Concl’-uion:  Z*.\ Z  cor  point  1  is  greatest  and  can  be  remove. 1  with 
OT  probability  by  method  1  and  97. 5^  probability  by  method  2, 


:»t«p  2*  The  pruceU-.tro  a.iin^  the  d  remaining  points  do ao  not  reject 
any  mote  data  peirtc. 
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jia$rar  27 

Conclusion!  2*^2  for  point  l  is  sreatest  and  can  be  removed  with 
greater  than  97.5’;  probability  Tor  both  methoJs,  Also,  Foint  10  can 
be  removed  with  05?.  probability  by  asthod  1  and  99%  probability 
by  nthod  8.  In  this  problem 

both  points  were  removed  in  this  step. 

'ho  recomputed  estimators  after  deleting  points  1  and  10  arc 


'•'hi:  procedure,  uslnp  thv  13  rc-.vair.in;’,  joints,  docs  not  reject  any  no re 
data  points. 
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SECTION  V 


THE  CONFIDENCE  INTERVAL  OF  THE  CEP 

5,1  Introduction 

The  previously  introduced  estimates  of  the  CEP  are  all  called 
point  estimates  where  the  estimate  of  the  CEP  was  defined  by  the  locus 
of  a  point  moving  at  a  constant  distance  (the  radius)  from  a  fixed 
point  (called  the  mean  or^ju  )).  This  constant  distance  or  radius 
is  called  the  CEP.  The  confidence  Interval  of  the  CEP  attempts  to 
give  some  measure  of  the  possible  error  in  the  estimate  of  the  CEP. 

The  confidence  is  defined  as  the  probability  that  the  true  value  of 
the  CEP  lies  in  on  interval  between  1^  and  where  and  are 
functions  of  the  random  observations  (X^Yj),  i  -  l,2,..*..n.  This  ex¬ 
pression  in  probability  notation  is 

(5.1)  P[t1(Xl...Xri,Y1...Yft)4CSPiL2(Xl...XntYl...Yn)J  :=/-* 

This  interval  estimate  is  e  function  of  the  confidence  required,  the 
number  of  observations,  and  the  estimate  of  the  standard  deviation  used, 

5*2  Obtaining  the  Interval  Estimate 

In  order  to  avoid  lengthy  computation  in  obtaining  the  interval 
estimate,  it  is  assumed  that  the  variances  are  equal.  That  is 

T '  .  Sly*  .  V* 

The  CEP  was  defined  in  Section  1.3  as  being  equal  to  kT  where  the 
value  k  is  a  function  of  the  ratio  of  the  variances  and  the  probability 
that  the  mean  centered  circle  oontalns  50%  of  the  bivariate  density  mass.'' 

^  See  Section  2,1 


/ 


Since  the  variances  are  assumed  to  be  equal,  the  ratio  of  the  variances 
is  1,  and  P(k,l)  "  .5,  so  that  k  ■  1.1774  (from  Table  1  with  c  ■  1). 
Although  the  variances  are  assumed  to  be  equal,  the  estimates  of  the 
variances  are  not  necessarily  equal. 

The  estimate  of  the  standard  deviation  will  be  determined  by  the  follow¬ 
ing  two  methods. 


5.2.1  Determining  the  Confidence  Interval,  Method  1 


In  this  method  max[tr^,  wm  selected  to  represent 


That  is 


£  , 

If  yf  £  is  divided  by  the  true  value  of  the  parameter  and  multiplied  by 
n»l,  this  formula  beoomes 


Although  tha  aum  in  (3.3)  will  not  be  an  exact  chi  squared  random  vari¬ 


able  because  it  ie  the  maximum  of  two  chi  squared  random  variables,  an 
approximate  oonfidence  interval  can  be  obtained  by  treating  (3.3)  as 
though  it  were  a  chi  squared  random  variable.* 

The  confidence  interval  defined  by  (3.1)  thus  becomss 


(3.4) 


See  Section  4.3. 


99 


The  values  of  X .  ^  and  K  ,  are  obtained  by  entering 

table  4  with  n-1  and  either  l«o<'2  or  «t/2  respectively. 


5.2.2  determining  the  Confidence  Interval,  Method  2. 

The  estimate  of  the  variance  in  this  method  is  the  average  of  the 
two  estimates.  That  is 

(5.9)  --  if  $(*ds±£  .  i  OL dLl 

Z  A  aL^  A )-!  r>  //-<  J 


If  (5.5)  is  divided  by  the  true  value  of  the  parameter^  and  tnuitiplied 
by  2 (n-1),  the  formula  becomes 

(j.o  Al^il  (rSi-fr)  :  t  +  2  O';  -  v  <l  ] 

where  >tj  and  are  normally  and  independently  distributed  and  x  and  y 
are  the  sample  moano.  This  formula  can  be  reduced  by  letting  the  values 
of  i  range  from  1  to  n  and  the  values  of  J  range  from  n  +  1  to  2n,  Then 
the  formula  becomes 


where  »  x,  for  k  ■  l........n  and  t.  ■  y,  for  k  «  n  +  l,.,.,,.2n  and 

k  k  k  '  k 

there  are  2(n«l)  squares  in  the  aum.  Thus  (5.7)  has  ay£  distribution 
with  2(n*l)  degrees  of  freedom  by  the  definition  given  in  (4*3).  The 
interval  estimate  is  determined  in  the  some  way  as  in  (5,4)  and  the 
formula  booonec 
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.he  values  and  /_<£  are  obtained  by  entering 

table  ■;  vi eh  2(n-l)  and  cither  l-ot'2  or  oC! 2  respectively.  It.  should 
bo  noted  that  this  method  of  interval  estimation  is  not  as  conserva¬ 
tive  as  method  1  because  the  average  value  is  always  less  than  the 
maximum  of  rheraforc,  this  interval  estimate  will  be  smaller, 

5,3  Illustration 

The  estimates  of  the  confidence  interval  of  the  CEP  used  in  the 
following  illustrations  were  obtained  with  the  data  from  Section  3 
and  (I-*'  *  .35,  A  comparison  is  made  between  method  1  and  method  2 
as  well  as  a  variation  of  the  two  methods  where  the  dependent  estimate 

✓v.  A  £ 

of  the  d.il  <  CEI'^  “  hTij)  was  substituted  for  1,1774  It  should  bo 
emphasised  that  none  of  the  distribution  theory  used  in  hot hod  2  holds 
when  *s  il8cc:  for  k  Tn  .  Therefore,  it  is  hard  to  get  a  mathe¬ 
matically  meaningful  comparison  between  these  methods, 

.‘able  !>  shows  the  various  estimates  of  the  CEP.  The  best  estimate 

/s 

o  '  the  CEP  is  most  likely  Co  be  CCi'^  due  to  the  basic  assumptions  of 

dependence  and  unequal  variances.  The  estimate  of  1,1774  7“  is  the 
1  max 

largest  estimate  of  the  CEP  and  therefore  the  most  conservative  estimate 
of  the  CEP. 
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Case  1 
Case  2 
Case  2 


Case  1 
Case  2 


Case  1 
Case  2 
Case  3 


Tables  c  and  c'.  show  the  upper  and  lower  bounds  of  the  confidence 
interval  estimates. 


Table  c 

T  .  f\*  .  _  v 

L1  '“1 . n* 

,  yp...yn)  «*  Lower  Bound  of  the 

Confidence  Interval  Estimate 

Problem 

Method  1  ! 

. - . .  . l 

Method  2 

Table  d 


(Xj. y ^ .  .y.a)  a  "pper  Hounds  of  the  Confidence  Interval  Estimate 


f  lethod 

i 

Method  2 

i.iniLjZi 

C^JnTi 

1,17747"  ^2  (n-1) 

avg 

cS2/2(n-l) 

It  is  noted  that  the  lower  bound  estimates  are  for  all  practical  purposes 
the  same  for  both  methods,  with  the  average  difference  being  only  .01. 
however,  the  upper  bound  differences  show  that  method  1  gives  a  greater 
estimate  with  the  average  difference  being  1.59.  The  lengths  of  the 
confidence  intervals  are  compared  in  Table  e  below. 


Table  e 


Length  of  the  Confidence  Interval  (Upper  bound  -  Lower  bound) 


Problem 


1-2  _ With  CEr2  1-2 

i.efhod  2  I  Difference  hethod  1  I  Method  2  Difference 


Case  1  4.35 
Case  2  3.05 
Case  3  2.10 


It  should  be  noted  that  the  confidence  interval  becomes  smaller  as  thu 
number  of  observations  increase.  This  implies  that  the  true  value  of 
the  CliP  is  more  likely  to  be  within  a  smaller  interval  as  the  number  of 
observations  increase. 

Diagrams  28,29  and  30  show  the  confidence  interval  using  the 
different  estimates.  The  confidence  intervals  were  obtained  by  using 
the  data  from  case  III  of  each  of  the  problems, 

5, A  Conclusions 

a 

Method  1,  using  l,1774^riax  produces  the  largest  estimates  and 
therefore  is  the  most  conservative  estimate  of  the  confidence  interval. 
However,  C2F2  and  1 .1774V  aVg  are  likely  to  be  better  estimates  of  the 
CEP  and  therefore  method  2  or  the  approximate  interval  using  the  depen- 
dent  estimate  C2P2  may  be  the  best  method  for  estimating  the  confidence 
interval.  An  analysis  of  actual  missile  data  should  give  a  more  realis¬ 
tic  insight  into  the  best  choice  of  methods  to  use  in  estimating  the 
confidence  interval.  In  order  to  come  to  any  definite  conclusions  about 
the  different  methods,  some  comprehensive  distribution  theory  problems 
must  be  solved. 
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SECTION  VI 
3'JKTIAilY 

6.1  Introduction 

The  previous  sections  hove  been  concerned  with  the  development  of 
different  types  of  models  and  methods  for  estimating  the  radius  of  the 
mean  centered  circle  which  includes  507.  of  a  bivariate  probability 
mass.  This  section  summarizes  the  different  models  and  methods  used, 
in  the  previous  sections,  and  includes  an  analysis  of  the  results 
obtained  from  problems.  Although  the  sample  problems  do  not  represent 
actual  missile  test  results,  an  attempt  has  been  made  to  make  the  data 
as  realistic  as  possible.  Therefore  an  analysis  of  the  problems 
should  show  certain  relationships  between  the  models  used  to  estimate 
the  CEP  that  would  also  apply  to  actual  missile  test  data, 

6.2  Comparison  Cf  :Y>del  I  With  Model  II. 

The  basic  underlying  assumption  made  in  Model  I  war  that  the  true 
value  of  the  mean  was  located  at  the  target,  (0,0).  Therefore,  the 
CEP  in  this  model  is  defined  as  the  radius  of  a  circle  around  the 
target, 

Tiie  basic  underlying  assumption  made  in  Itodel  II  was  that  the  true 

value  of  the  mean  was  located  at  some  point  (u  ,u  )  away  from  the 

x  y 

target.  Therefore,  the  astimated  CcT  for  this  .iodel  is  the  radius  of 
a  circle  with  center  at  some  point,  (x,y). 

A  comparison  of  the  estimate  of  the  correlation  coefficient  shows 
that  they  change  in  much  the  sane  manner  in  both  models.  As  suspected, 
a  major  difference  between  these  xod?ls  is  in  the  location  of  $  and  y. 
This  is  shown  ir.  ^lagramj  hi,  32.,  and  23  which  illustrate  the  estimates 
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of  che  Cal'.  The  estimate  of  the  Cap  for  problems  2  and  3  is  practically 
fie  sa.r.e  in  all  three  cases.  Therefore,  when  the  center  of  the  distri¬ 
bution  is  near  the  target,  there  is  little  practical  difference  between 
the  tvra  models.  However,  in  problem  1,  the  distribution  of  data  points 
is  around  some  point  (x,y)  away  from  the  center.  If  the  procedure 
given  in  Appendix  3  is  used  to  estimate  the  ratio  function,  then  the 
values  obtained  indicate  that  CEP2  gives  the  best  estimate  of  the  CEP 
for  a  sample  size  of  10  in  problem  1.  Also,  as  the  sample  size  increases 

the  ratio  function  increases,  thus  C2P2  is  also  the  best  estimate  for 

/ 

n  >  10.  The  values  of  R. F.  obtained  for  problems  2  and  3  show  a  pre- 

J 

ference  for  Model  I  for  small  sample  sizes  and  are  very  close  to  1  for 
large  sample  sizes  and  therefore  either  estimate  may  be  used. 

These  problems  tend  to  substantiate  the  fact  that  the  procedure  of 
Model  II  Is  superior  to  the  procedure  of  Model  I  in  large  sample  sizes. 
They  also  suggest  that  if  Model  I  is  used  ir.  analyzing  a  small  number 
of  observations,  it  might  be  advantageous  to  check  the  assumption  of 
mean  (0,0)  by  computing  the  sample  means. 
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6.3  Comparison  Of  The  Independent  And  Dependent  Methods  Of  Estimating 
The  C2F. 

In  the  introduction  to  the  problem  of  estimating  the  CEP,  the 
assumption  was  made  that  the  errors  in  the  x  and  y  directions  were  not 
independent.  Tills  assumption  is  natural  unless  an  apriori  knowledge 
suggests  that  the  errors  in  the  x  and  y  directions  are  independent, 
however,  the  assumption  of  independence  in  the  fire  control  problem  is 
quite  difficult  to  justify  due  to  its  complexity.  Therefore,  it  would 
seem  wise  to  estimate  the  magnitude  of  the  error  involved  in  assuming 
independence  in  order  to  find  out  how  much  difference  this  assumption 
will  mean  in  the  determination  of  the  CEP. 

It  was  shown  in  Appendix  A  that  the  true  orientation  of  the  density 
function  was  related  to  the  correlation  coefficient.  If  the  true  shape 
of  the  density  function  is  oriented  at  some  angle  with  respect  to  the 
x  and  y  axes  and  independence  is  assumed,  the  computed  standard  devia» 
tion  is  not  the  best  estimate  of  the  standard  deviation.  Consequently 
the  independence  assumption  introduces  an  additional  error  in  the 
estimate  of  the  CSP. 

Table  f  is  used  to  illustrate  some  of  the  important  differences 
in  the  results  obtained  from  the  problems  using  the  tw>  models. 
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Table  £ 


Computed  cifferences  Between  ibdsls  I  and  II 


i  Case  I 


DIff  in 
of  C2P  cst.  of 
stand. 
CEP  dev. 

A  * 

3.64  .15 

3.37  .77 

3.23  .32 


3.33  .81 

3.39  1.06 
3.71  .62 


4.66  .69 
3.56  .65 
3.52  .65 


Cst.  Correl. 
of  coeff. 
standi  /\ 
dev.  P 


.33  .45  .095 
.25  -.23  .125 
.27  .22  -.116 


H 


.825  -.34 

.903  !  .06 

.650  I  -.12 


19 


The  table  shows  some  difference  in  the  magnitude  of  the  radius  of 
the  CEP  as  estimated  by  the  two  models  with  the  maximum  difference  being 
•  33/3.37  or  3.371.  Also,  the  trend  in  the  size  remains  constant  between 
the  two  models.  That  is,  as  the  size  of  the  estimated  CSP  changes  in 
one  model,  it  changes  in  the  other  model  in  the  same  direction. 

Srapli  1  shows  a  plot  of  the  percent  difference  in  the  independent 

» 

and  dependent  estimates  versus  the  correlation  coefficient.  It  should  be 
emphasized  that  the  points  on  the  graph  were  obtained  from  data  oomputed 
from  the  sample  problem. 

The  differences  in  the  estimates  of  the  CEP  from  the  problems  are 
shown  in  diagrams  34,  35,  and  36,  Some  of  the  differences  were  so 
small  that  rheso  estimates  were  left  otf.  It  is  interesting  to  note 
that  the  distribution  of  data  ii.  problem  3  shows  almost  perfect  correla¬ 


tion  and  the  estimated  differences  were  also  a  maximum. 
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6,4  JJffecta  Caused  By  The  Removal  Of  Outliers* 


The  most  obvious  effect  on  the  CEP  when  outliers  are  removed  Is 
that  the  CSP  becomes  smaller*  However,  there  are  several  other  effects 
which  are  not  obvious  but  may  be  Important  In  determining  which  estima¬ 
tors  oan  be  used*  Tabla  g,  using  the  sample  problems,  gives  a  comparison 
between  Method  X  and  Method  XX  and  the  estimates  of  certain  parameters 
before  and  affiar  removal  of  outliers* 


Tabla  3 

Bffsots  Of  Removing  Outliers  On  6&2,  P 

,  and  (Vx-7y) 

Problem 

P(TVpe  X  Srror] 

& 

Correlation  a 

1  Difference  in 

Coefficient  P 

Stand.  1 

Dev.CC- 

Method  X 

Method  XX 

Before 

After 

Before 

After 

Before 

Aftei 

PmfJJ 

outlier 

outlier 

outlier 

outlier 

outll 

1  -V;  M 

ramaved 

■RRujT 

X 

Case  1 

.10 

.09 

- — 1 

3.12 

.383 

l 

.039 

HIM 

1 

2 

•10 

.09 

EH 

3.04 

.031 

.136 

1 m 

1 

XX 

Gaea  1 

•09 

.029 

3.33 

2.31 

-.626 

.201 

1 

.81 

.34 

2 

.09 

.009 

3*39 

2,79 

-399 

-.03 

1.06 

.52 

XXX 

Casa  1 

.09 

.029 

4.66 

3.03 

•623 

•306 

.69 

.43 

2 

.09 

.01 

3.96 

2.42 

.903 

.079 

.63 

.29 

The  estimate  of  the  CSP  wee  reduced  by  from  US  to  36%  in  the 
probicme  by  the  removel  of  outliers*  ,  Xf  e  probability  of  the  Type  X 
error  bed  been  speeified  as  *09,  the  point  rejected  as  an  outlier  in 
problem  1  would  net  have  been  rejected  by  the  elliptical  method  but 
would  have  been  rejected  by  Method  XX,  Thla  ie  because  Method  X  end  * 
Method  Xt  cm  net  the  eame  end  will  not  neeeeeerlly  reject  the  eeme 
pelnte  far  the  eame  oonfldenoe  level*  The  effeete  of  removing  outliers 
ere  shorn  in  Diagrams  37*  38*  and  39* 

79 


It  should  be  noted  that  the  removal  of  outliers  may  change  both 
the  correlation  coefficients  and  the  difference  between  the  standard 
deviations  in  the  x  and  y  directions*  This  la  due  to  the  large  effect 
that  an  outlier  has  upon  the  distribution  parameters*  Thus  a  large 
oorrelatlon  coefficient  may  be  due  to  the  presence  of  an  outlier  and 
not  due  to  correlation  between  the  errors  in  the  x  and  y  directions* 
Therefore*  before  the  independent  method  of  estimation  is  rejected*  an 
investigation  should  be  made  for  outliers* 
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'.'able  I  is  nr*  otcerpt  fro.*  a  table  derived  a:  the  Oavnl  '.’capons  Laboratory, 
Oahlgren,  Virginia,  under  the  direction  of  ’.ar*ry  icingarter.  and 
A.  II,  OiOoneto.  Their  table  presents  solutions  for  the  value  of  It 


satisfying 


du  dv  »  F 


where  C  Is  the  circle  +  v>  * 


and  ?  -  OC.OI) .99. 

See  Appendix  C  for  voluiae  of  integration. 
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APPENDIX 


.  .JLiii'l  iA 


noi-i  ai.'D  niAiiSForj -\rion  o?  ths  bivariate  density  function 


A.  I  Introduction  '  ■ 

This  appendix  is  concerned  with  the  orientation  of  the  bivariate 
normal  density  function  ever  the  x}y  plane.  Primarily  this  requires  an 
investigation  of  the  correlation  between  the  random  variables  X  and  Y 
and  once  the  correlation  is  determined,  a  transformation  of  axes  so 
that  the  function  can  be  integrated  more  easily. 


A.2  Orientation  of  the  Axes 

If  the  correlation  coefficient  is  zero,  that  is  the  random  variables 
X  and  Y  are  independent,  the  orientation  will  be  symmetrical  with 
respect  to  the  x  and  y  axes.  This  means  that  a  plane  parallel  to  the 
x  and  y  plane  will  cut  the  density  function  in  the  form  of  an  ellipse 
whose  minor  and  major  axes  are  parallel  to  the  x  and  y  axes.  This  is 
shown  below  in  figure  A.  1,  Note  that  ifVjc»Vy,  the  ellipse  becomes  a 


Orientation  of  the  Ellipse  Jhen  P  »0. 

Figure  A.l 

If  t  ..  t\-r  relation  coefficient  is  not  zero  and  less  than  plus  or 
minus  1,  t  \a  orientation  is  offset  i'rcm  the  x  end  y  axes  in  the  direction 


SO 


4 


T 


in  figures  A.iu  and  A,  2b. 

\ 


/ 


/ 


Orientation  of  the  Ellipse 
when  C<P<  1 


\ 


\ 


("  "I  *  *ns 

i  tit  (  / K,  t-Ay)  •, 


at  ion  of  the  Ellipse 
’.■hen  -KfdO 


Figure  A. 2a  Figure  A. 2b 

The  -error  introduced  in  assuming  independence,  when  the  random 
variables  ::  and  V  arc  not  independent  is  a  function  of  the  correlation 
coefficient  ar.ri  is  due  to  the  true  orientation  of  the  density  function 
with  respect  to  the  x  and  y  axes.  If  it  is  assumed  that  the  errors  in 
the  x  and  y  directions  are  independent  when  in  fact  they  arc  not,  an 
additional  error  will  vc  made  in  computing  the  estimates  of  the  van  races 

A  A 

This  is  due  to  the  fact  that  the  computation  of  will  be  in  the  direc 
tion  of  the  assumed  axes  (x,y)  instead  of  the  direction  of  true  orien¬ 
tation  ii(h|V)  ,3('T|  ') •  Therefore,  it  becomes  important  to  obtain  some 
knowledge  cf  the  true  orientation  in  order  to  obtain  the  best  estimates 
of  the  variance:.'.  This  can  be  done  by  obtaining  estimates  of  the  sngles 
bet  wen  the  assumed  axes  and  th?  true  axes. 

The  following  sections  are  devotee*  to  the  different  possible  orien¬ 
tations  cue  to  "he  different  ranges  of  the  correlation  coefficient. 


*•  - 


bef  4,,'iratien  of  the  Orientation  of  the  .Axes 


A.2.1  Doterj.iair.g  the  True  Orientation  when  the  Correlation  Coefficich 


is  zero. 


If  P  “0,  the  random  variables  X  and  V  are  independent,  Therefore. 
2(X|Y)  *  E(0  and  S ('■.'!■’)  «  C(i’) •  Hus  can  be  proven  by  determination 
of  either  the  value  of  the  conditional  expectation  directly  or  indirectly 
using  the  linear  predictor. 

A. 3.1.1  Direct  Eetermiration  of  the  Conditional  expectation  if  f*  -0. 

fhe  expected  value  of  one  random  variable  given  the  value  of  the 
other  random  variable  was  defined  in  Section  1.2.5  as 

CA.1)  3C!IY)  -  f X £  JX'V)  d*  --  f*  f 

--  W  ~&i  ■'  ife*  I 


Hor  Vy  Xo 


/aF  ^5 


j  /X  , 

dj  Je+tueJ 


tt/  7/oii  tJiX 


The  2 (Y | M)  can  be  determined  in  the  same  way  and  is  equal  to  2(Y). 

A, 3. 1.2  Indirect  Determination  of  the  Conditional  Expectation  Using  the 
Rest  Linear  Predictor. 

The  conditional  expectation  of  one  random  variable  given  the  value 
of-. the  other  random  variable  is  a  linear  function  of  the  known  random 
variable  when  both  random  variables  are  jointly  normally  distributed, 

1’hac  is  the  E(  ’.l  V)  -  AY  ♦  3*  where  A  and  3  are  constants  which  can  be 
determined.  The  linear  predictors  for  the  conditional  expectations  under 
consideration  vijl  be  defined  as  follows: 
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(A.  2) 


(°  -  CCVOC.Y) 


SCO] 

where  C,  COV(X.Y) 

1  “ 

V*y 

rS 

2(V)] 

where  C„  _  CCV(X.Y) 

-  J&L 

VA.t(Y) 

TV* 

from  1.2.5 


In  the  case  where  6  -0  “  C0V(XtY)  .  COV(X.Y)  -  C  -  C  -  C  from  (A.2) 

vWxJ  1  2 


Therefore,  the  results  become  the  same  as  in  Section  A.2. 1.1.  That  is 

(A. 3)  Z(X/Y)  »  2(.'’)  and 
E(Y/X)  «2(Y) 


| 


In  the  case  that  f5  -0,  the  orientation  of  the  axes  will  be  as  shown  in 


Orientation  of  the  Density 
Function  when  the  Correlation 
Coefficient  Is  Zero. 

Fisure  A. 3s 


Orientation  of  the  Axes  uhen 
the  Correlation  Coefficient 
is  Zero. 

Figure  A. 3b 


I 


t 
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A.3.2  Determination  of  the  Orientation  if  P  -  1, 

If  f  »  !,  the  EC-’/ Y)  and  2 (V/X)  mil  lie  along  the  same  axis.  This 
can  be  proven  by  using  the  best  linear  predictors  in  formula  (A. 2)  and 
the  definition  of  the  correlation  coefficient.  That  is 


(\.4)  P  =  i  *  COV(X.Y)  and  therefore  OOV(X,V)  - JVAUO) J VASO) . 


Then  using  formula  (A.2)  E(Y i X) -E (Y)  +Cl t.0-200]  -2 (Y) - Cj_  SCO+C^X 
where 


(A.5)  C,  *  COVC’.Y)  “  VATi(Y)  using  the  result  in  formula  A.4.  Since 
1  7AzBm,n  VA2?xT 


E (.'0 ,  E(Y),  and  j  VAX Cv)  are  constants,  the  random  variable  S(YIM)  is  of 

i  vaITcT 

the  fortr 


(A.u)  E(Y| vihere  A»Cj  and  3-2(Y)-E(a) Cj.  The  tangent  of  the 
angle  between  the  y  axis  and  the  line  2(Yl‘0  is 

(A.7)  d[2(Y/ ■  Tan®  «  C,  ct  [ VAA(Y)  in  the  case  where  P-  1. 

Jx  V  vXam 


The  tangent  of  the  angle  between  the  x  axis  and  the  line  S(MIY)  is 
determined  in  the  same  way  and  in  this  case 

(A,?)  fanfl  -  C,  «  fV/TltT)  in  the  case  where  P  «  1.  Since  fang»f 7A.“(X) 

“  V  vaiTvT  i 


1 _  -  XT®  the  lines  wist  he  the  same. 


i 


Therefore,  in  the  case  that  P  =  1,  the  orientation  of  the  axes  vill  be 


Orientation  of  the  Density  Orientation  of  the  Axes  when 

Function  when  the  Correlation  the  Correlation  Coefficient 

Coefficient  is  1.  is  1. 

Figure  A. 4 a  Figure  A. 4b 

A.3»3  Determination  of  the  Orientation  of  the  Axes  if  Q<(*<  1. 

If  0  <  P <  1,  the*  two  lines  EC’IV)  nr.c!  2(T|")  will  not  be  the  same 
or  perpendicular  and  will  be  oriented  as  shown  in  figures  A.5a  and  A.5b. 
This  can  he  proven  by  using  the  same  method  as  in  Section  A.3.2,  except 
that 

(V->  0  <  ccvfh y)  _  <  1 

VA.I(a)  V.Mi(V) 

It  follows  from  this  that  0  f  0CV(‘(,Y)  <  fVAil(30  /  VAF(V).  Then  using 
formulas  A.7  and  A."  with  the  definitions  of  the  constants  in  formulas 


A.2,  the  fieri ro<‘  angles  are 


the  complete  range  of  possible  values  for  the  tvio  angles  using  A. 9  are 


( A.  1 1 )  0<  Q  <  Tan' 1  j  VAA(V) 

V  VAZ(;$ 


0<  (?  <  'fan™1  /  VAilpO 
V  VAIi(V) 


In  this  case  the  orientation  of  the 


Orientation  of  the  Density 
function  when  0  <  f  4  1. 


figure  A. 5a 


will  be  as  shorn  in  figures 


C.ientation  of  the  Axes  when 
the  Correlation  Coefficient 
is  0  <  f  <  1. 


Figure  A. 5b 


A, 3.4  Determination  of  the  Orientation  of  the  axes  if  -1<.P<  0. 

If  -1  <(*<  0,  it  follows:  from  forinulas  A.9,  A.10  and  A.ll  that 


(A.  12) 


<  0, 


tan 


<  0 


In  this  case  th*  orientation  of  the  axes  will  be  as  shotm  in  figures 
A. On  and  A. 6b. 


Orientation  of  the  Density 
Function  when  -1  <  P  <  0 


Orientation  of  the  Axes 
when  -1  <  f  <  0. 


Figure  A. 5a 


Figure  A. 6b 


A.4  Ill jstrations 

Although  the  true  orientation  of  the  axes  will  not  be  known,  it 
can  be  estimated  by  using  the  various  estimators  shown  in  table  h  below: 


i  Table  h 

l 

U.  — - -  - -  -  - - -  — ■  —  — 


|  estimators  Used  in  Determining  Estimated  Axes  Orientation 


Estimator  used 

■  -  ■  - 

Value  Estimated 

X 

SCO 

X , 

E(Y) 

Wv 

VAR(X) 

Vy 

VAACY) 

ft 

%  . 

VVAR(Y) 

ft* 

OOV(X.Y) 

The  estimated  parameters  in  the  illustrations  which  follow  are 
determined  by  using  the  data  from  the  example  problems  in  Section  II. 
A.4.1  Illustration  (1) 

The  data  is  obtained  from  example  problem  no.  1  with  a  sample  sise 
of  n  ■  25. 


I  4 
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1.2,  V  -  1.?, 


6.3,  7y  =  3.6,  -.4,  ?  -  -.05 

S(Y)  +  CfM  -  SCX)]  ,  SCUY)  -  200  +  c2£y  -  S(Y>] 


The  orientation  of  the  axes  is  shorn  in  figure  A. 7 


Estimated  Orientation 
of  the  Axes 


Figure  A. 7 


It  should  be  noted  that  the  orientation  of  the  axes  in  figure  A.7 
Implies  that  the  random  variables  X  and  Y  arc  nearly  independent  and 
that  the  independent  model  of  computing  the  CEP  can  be  used  with  only 
a  small  error  due  to  the  orientation.  The  computed  values  for  the  two 
different  estimates  of  the  CEP  are  CEP2-  3.28  and  '£sp£-  3.23 
A.4,2  Illustration  (2) 

The  data  obtained  from  problem  3  with  a  sample  size  of  n  »  15. 
x  -  -.5,  y  -  -.2,  %  -  10.7,  ^  15.4,  11.6,  P  -  .;0 

B COX)  •  ECY)  ♦  Cf  X-SU)]  ,  E(XIY)  •  S  f  Cj  [  Y-S(Y)3 
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VAR (X) 

6  -  47° 16* 


1.082  «  land 


C2  =»  C0V(;:,Y)  -  .755  -  Tan0 
VAR(Y) 

0  »  37°3* 


The  orientation  of  the  axes  is  shown  in  € i jtu re  A.S 


Estimated  Orientation  of  the  Axes 
wien  Dependence  is  Implied 

Figure  A, 3 


It  should  be  noted  that  if  this  were  the  true  orientation,  it 
implies  almost  perfect  correlation  between  the  random  variables  X  and 
Y.  This  orientation  will  exhibit  the  greatest  difference  in  the  esti> 
mates  of  the  CEP  if  independence  vere  initially  assumed.  The  computed 
values  for  the  two  different  estimates  of  the  CSP  are  CEP2*  3.52  and 
CEPJ  -  3.72.  • 

A.4.3  Illustration  (3) 

The  data  for  this  illustration  is  also  obtained  from  problem  3  with 
a  sample  size  of  n  •  15,  owevex,  in  this  case*  the  two  outliers  have 
been  removed  and  the  sample  size  used  for  the  computation  is  13, 
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x  *  -.8, 

j  *  .4s,  3.64,  Vy  ■  4.8 

-A  A 

Ty-  -32,  f 

-  .07 

_ — . 

s(Y/:0  - 

3(Y)  +  CjC”-200l 

EC'.IY)  -  ECO 

+  c2Cy-e(y; 

where 

C1 

=  9xy  a  .033  *■  T and 

c2  "  -Sx 

_  =  ,066  ■ 

%x 

3  «  5°3* 


0  -  3°4o* 


rhe  orientation  of  the  axes  after  removal  of  the  outliers  is  shown  in 


figure  A. 9 


|(X«y) 

- X  ax/j 


I 

Estimated  Orientation  of  the  Axes 
After  Removal  of  the  Outliers 


Figure  A, 9 


It  should  be  noted  that  the  removal  of  the  outliers  rotated  the 
axes  enough  so  that  independence  oould  be  assumed  with  only  a  small  error 
in  the  estimate  of  the  CEP.  The  computed  values  for  the  two  different 
estimates  of  the  CEP  are  2. 42  and  6ei)£  «*  2.52.  Thus  the  removal 

of  outliers  will  not  only  reduce  the  size  of  the  CEP  but  may  aid  in  the 
determination  of  whether  the  simpler  model  of  independent  estimates 

‘a 

may  be  used  or  not. 


A. 5  Transformation  of  the  Axes 

In  order  to  integrate  tre r.i.u. ;  (\«1)  of  Section  1,  it  is  necessary 
to  transform  the  axes.  This  trcosccr-.r^ujon  can  be  done  in  several  ways 
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but  the  use  of  matrix  notation  can  greatly  simplify  the  procedure. 

It  is  necessary  to  first  define  some  of  the  concepts  which  will  be  used. 


A. 5.1  Definitions: 

A, 5. 1,1  The  matrix  A=>(a^ ) 
simplification. 


where 


A.5.1.2 


The  transposed  matrix  is  defined  as  A* 


A.5. 1.2,1  Theorem  1.  The  transpose  of  A,«»(A,)'*A 
A.5,1.3  The  inverse  of  A  is  defined  as  the  matrix  A“*  such  that 
AA-1  =*/l  0\ 

lo  1/ 

A.5.1.4  The  identity  matrix  I  *=/ 1  o\ 

lO  1' 

A.5.1.5  A  symmetric  matrix  is  defined  as  a  matrix  such  that  the 
transpose  of  the  matrix  A  equals  A.  That  is 


A.5.1.6  If  C  is  a  2x2  matrix  such  that  C*C  ■  I,  then  C  is  defined 
as  an  orthogonal  matrix  and  C'*C**. 

A.5.1.7  A  characteristic  root  of  a  2x2  matrix  A  is  a  scalar  X  such 
that  AX»  A  X  and  AX«  AX-0  for  some  vector  X  t  0;  It  follows 
that  if  A  is  a  characteristic  root  of  A,  then  (A-Al)X»0 
and  therefore  ^A-AI|“  0. 
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;i 

:  4r 


it 


*  I 


UC  ?'  c-  :!iasc-nai  matrix  D  is  defined  as  a  square  matrix 
’•’••use  off  elccoisai  elements  arc  all  zero,  That  is., 
j"  the  if  i*j,  In  tins  example 


^0 


0  \ 


22' 


■'-f.I.A  The  quadratic  fom:  Q  is  defined  as  >2»AZ 
A. 5.2  vi ie  bivariate  normal  density  function  in  matrix  notation  is 


2  7T  /A-/  .  ; 


where  A  ~ 


7iTy 

Tr 


) 


A.l„/v  fK7y\ 
(  p£?y  v// 


It  should  ho  noted  that  A  and  A**  are  both  symmetric  matrices.  A  .\t 
is  A*  =  A*  and  A"*«(A*1)**  Thus  the  theorem  applies  that  Cor  every  sym¬ 
metric  matrix  A”’  there  exists  on  orthogonal  matrix  C  s.ich  that  C‘ .A'  •  h~h 
where  D  is  a  diagonal  matrix  whose  diagonal  elements  arc*  the  chance  er¬ 
istic  roots  of  A"1.  The  matrix  D  would  thus  be 

X,  o 


(  '  ) 
u 


where  A,  and  A*  arc  the  character i  s::  -  c 
roots  of  the  matrix  A. 

In  order  to  find  the  characteristic  roots  c£  A''*  ».rc  must  first  use  the 
identity  matrix  I  »^1  0 y  then  IA*^!  OjA  -  ^A  0 j 

Th.e  characteristic  roots  of  a  syarr.etr i c  matrix  are  determined  from  the 
characteristic  polynomial  f 'A) *» / A~ ' -  Al/’-O 

l/K  1  Ky\ 

I  Uy  Tyl  * 


'A  o 
o  A 
* 


-  O  ' 


Txl-X 

rxy 

V,y 

77"- A 

--  o 


(^-  A)(  Vyl-A)  -  ^ 

a*-  ^ylAA  *•  ^  =  °  Best  Available  Cop' 
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3is  i  s  r.:*.  1  v  i  ’  . air«  :•  the  ooncral  fort: 
o 

ay/-  !•  :•  c  =  v 


quadratic  equation. 


sl tHrxi t vr  -  y 

5L 

tS  +  ts  i-Jto'-n't  tJJEl  -  r1- 

Y  H 


K--* 

v  --  cA 


<-  v/  -  ./  05 1  -  gvj  A  *  v  gd _ ; 

3t 

o  x  ;  /<r«fc  o  \  = 

A*.  i  I;*)  ^  / 


M  I  ^ 

V£)  Ai. 


/  /  tr*3-  O 

’(*■  )  Wo  -t 


'Che  transformed  density  function  is 

o.i:)  cj _ J  -  :.-=rr^ 


cJu,v 


._J _  evl  w,v,’V 

a*r  (A*"‘//t' 


•Jo  not.’ Ijnvc  a  normal  bivariate  density  function  with  independent  v::,i  ’ 
variables  U  and  V  such  that  the  matrix  ..  /  \is  distributee!  i*v.  >-*  / 


and  U  »  C'Z  where  C  is  the  orthogonal  matrix  defined  which  satisfies 
the  relationship  C'A“1"C  *  A*'".  Chore  to  re  t*  C  is  also  '-exu.i-  u-.. 


Cl  i  ore  fo 

1'  vi 

Cf  z 

is 

a  1 so  d i s 

tri’v . 

invclv: 

p  f  f 

1 1  C’ 

*'  he 

co  r 

relation 

constant 

0  *  a 

for 

du 

s  distri 

but ion.  It 

iJ— 

){^ 

o 

vn 

■« 

ho  oricn 

tat  ion  of  t 

1° 

V 

A"i 

i*.  ;*  •  ' 

on-: 

t  .  - 

:  i  « l 

r,b. 

outi  in  ' 

i;;uvc  -.,11. 

Best  Available  Copy 


Ellipse  Forced  by  Cutting  the  livarinte 
hor:.;al  Density  Functionate,  v)  by  a  Plane 
Parallel  to  the  u,v  Axes. 

. .  Figure  A.  1 

Fortunately  it  is  not  necessary  to  compute  the  orthogonal  •..■atrix  J 

which  satisfies  the  relationships  above  since  the  characteristics  cf  the 

orthogonal  ratrix  J  requires  that  C'C  =  I  and  C*  °  C"1.  Then 

(A.  14)  (VJ  *A*(’?  -  (CZ  ’  A*(C*  Z  -(2  »CA*C»(Z  -(2  ’A(Z 

but  C*  A" 1 C“A*~ 1 

there  f o  re  ( C» A" 1 C) “ 1 « ( A*" 1 ) - 1 -A* 

“C”1  AOA* 

CC”lACC"1«CA*£Tl 


Therefore,  it  can  also  be  shown  that  the  corresponding  areas  under  the 
density  functions  are  equal.  That  is 


(A.  15)  /jfy-  ,.Cu,v)du'lv»  „(x,y)dxdy 


whore:  Cj  (u , v) 


27T  j  A*-  Ll  j 


i:-:p  'VJ,A’'  ! 


f..  ..(x.v)  «=  t  e>:n  ~'-Z'  AZ 

21  | A" if  T 


Aden  ©iqeiiEAV  iseg 
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This  is  because  VJ* A*'.’  =  Z'AZ  as  shown  above  and  J A*"1[»|A”1/  .  It  is 

stoun  above  that  C“lA"IC  =  A*”1  and  the  determinates  of  the  two  terms,, 

are  /C“1ACf*> /A*“1/  =  |C-1||A"1/  / Cf^jA*"1/  “/A*1/  Id-/ A-1/ 

I  C I 


There -ore 


,,!u,v)d«Jv=jJ E.,  v(x,y)d«dy 


AdoQ  0|qB|reAV  iseg 


APPENDIX  3 


REMARKS  ON  ESTIMATION 


3.1  Introduction 

The  theory  of  estimation  is  concerned  with  the  problem  of  finding 
functions  of  the  observations  such  that  the  distribution  of  these 
functions  will  be  concentrated  as  closely  as  possible  near  the  true 
values  of  the  parameters  estimated.  The  density 'function  of  the  obser¬ 
vations  under  consideration  was  described  in  Section  I  and  the  para¬ 
meters  which  are  to  be  estimated  are  ux>u  »  ;Tv>  V.  ** nd  • 

^  y  v  y 

Some  of  the  properties  which  are  desired  of  the  estimators  were 
described  in  Section  1.3. 

3.2  Maximum  Likelihood  Estimation 

If  .  (XpXj. • . .  •  .xn>y i »V2« . . . » *yn»ux»u. ,*  V't  C  )  is  the 

density  function  for  a  random  simple  of  size  n  with  unknown  parameters 
ux»uy»  V-v,  and  C  ,  then  the  likelihood  function  is 


/J 


(3.n  l  "T if t) 

-  ff 

*TT  Vx  Vy  ji-pr  '  .  '■  " 


Since  it  is  more  convenient  to  deal  with  sums  than  products,  it 
is  easier  to  maximize  the  logarithm  of  the  likelihood  function  rather 
than  the  likelihood  function  itself.  It  should  be  noted  that  the 
logarithm  has  its  maximum  at  the  same  point  as  does  the  likelihood 
function.  The  log  of  (3.1)  is 

(3.2)  L‘  *  -v/oaZW-f/o<,vf~f  Vf  - 

v/  V 

J! 

3^-e5)  /  LV  %  J  •  V.  /{  7  )  1  [  V-y  )} 


j.zi 
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Best  Available  Copy 


The  maximum  likelihood  estimate  of  each  of  the  unknown  parameters 
is  obtained  by  setting  the  derivative  of  the  function  with  respect  to 
each  of  the  unknow,  parameters  equal  to  zero  and  then  solving  the 


resulting  equations  simultaneously.  To  illustrate  this  procedure,  the 
assumptions  will  be  made  that^=7y®  V  andf"  0.  For  this  special 


case,  formula  (B.2)  becomes 


•M-  „  a 

(B.3)  L»  -  -N  log  27T  -N  logV*  ‘ 


and  the  partial  derivatives  are 


Equating  the  partial  derivatives  to  zero  and  solving  simultaneously. 


it  follows  that, 


<B.A)  U* 


(B.5)  Uy 
CB.6)  T* 


Since  maximum  likelihood  estimators  are  in  general  biased  eati- 
%*  •  .• 

nators,  it  is  necessary  to  examine  them  to  see  whether  they  are  unbiased. 

/> 

Fbr  example,  if  the  expected  value  of  the  estimator  0is  equal  to  B,  & 
where  Q  la  the  true  parameter,  then 
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(B.7) 


and 


-o 

0 

6, 


is  an  unbiased  estimator. 


Since  £  (xj)  »  ux  and  ECy^  ■  for  i  -  1 . ».n,  it  follows 

that  "x  and  y  are  unbiased  estimators  for  ux  and  respectively. 

The  expected  value  of  the  estimator  in  formula  (B.6)  is  obtained 
by  recognizing  the  fact  that  there  are  2(n-l)  independent  squares  in 
the  sum  and  therefore  2n  «»  a  chi  squared  random  variable  with 

2(n-l)  degrees  of  freedom  as  defined  in  formula  (4.5).  Since  the  ex¬ 
pected  value  of  a  chi  squared  random  variable  is  equal  to  its  degrees 
of  freedom,  it  follows  that 

(B.8)  E  (**$*-)  •  2(n-l). 


Therefore, 

(b.9) 


AH  IT*  .  P  v3 
N 


and 


r, 


,  Tlftfa-x' 7 1 1 1 


s  an 


unbiased  estimator  of  V* when  the  variances  are  equal.  When  the  vari« 
anecs  are  not  equal,  the  same  procedure  may  be  used  and  the  unbiased 
estimators  of  ^  and  Yy  are 


(B. 10) 


rr»  7  (Xa-x)* 

T*  "  - 

f*  .  fcn-9 f  9 

a/-/  • 


It  should  be  noted  that  the  estimators  (B.10)  are  used  in  Model  II. 
Also,  if  the  assumption  is  made  that  the  true  values  of  the  means  ara 
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zero,  t.hen  the  estimators  for  the  variances  in  Model  I  are  also  unbiased* 
The  estimators  of  the  means  and  variance  used  in  Model  I  and  Model  II 
are,  apart  from  the  biasing  factors,  maximum  likelihood  estimators. 
However,  it  should  be  noted  that  the  CEP  is  a  function  of  the 
standard  deviation  and  not  the  variance.  The  following  section  will 
determine  unbiased  estimators  for  the  CEP  using  the  procedure  in  this 
section, 

3.3  Unbiased  "Maximum  Likelihood"  Estimate  Of  The  CEP:  When 

Vy  =  Vy  m  V  and  ("'  »  0. 


The  maximum  likelihood  functional 'T’v.-hen  u  *  u  «*  0  is 

*  y 


(3.11)  VT  -  /  577-2  + 


_  «N 

fince  the  sum  in  (3.11)  divided  by  V  has  a  chi  squared  distribution, 
it  follows  that  the  square  root  of  a  chi  squared  random  variable  di¬ 
vided  by  its  degrees  of  freedom  has  a  chi  distribution.  The  density 
function  of  a  chi  distributed  random  variable  with  2n  degrees  of  free¬ 
dom  i  s  , 


( B. 12)  fu(u)  - 


u _ 

r^i 


u  ?  0 


-  0  u  4  0 

where  /  (n)  is  the  gamma  function  with  parameter  n. 

Then, 


1 


r.(N*j)  r 

P  (A/)  fN 


and 


''N 


(3.13)  E(W) 


A 

vr 


rp*i)  !  ^  & 


an 


unbiased  estimate  of  V~  and  therefore 


(3.14)  CEP**  »  1.1774 is  an  unbiased  estimate  of  the  CEP. 

The  maximum  likelihood  estimator  of  V  when  the  means  are  not 
zero  is 

(3.15)  v,.  -  Ug  fiou-xf 'm-vf  ] 


Therefore, 


and 


(B.17) 


|  .  QtnEL  . 

1  n*-i)  Jm 


an  unbiased  estimate  of  V~  .  Therefore 


CB. 18)  CEPi*  -  1,1774  is  an  unbiased  estimate,  of  the  CEP. 


The  reader  may  be  Interested  in  the  magnitudes  of  the  biasing 
factors  and  a  comparison  of  the  biased  and  unbiased  estimators  of  the 
CEP.  The  results  obtained  using  the  data  from  the  sample  problems  are 
presented  in  Tables  i  and  j. 


UO 


Table  1 


Comparison  of  the  Biasing  Factors  of 

the  Two  Estimators 

i 

Case 

CEP** 

CEP** 

1 

i 

j 

1 

3, ,  -/lO  /7l0)  -  1.01 

7TTo75T 

11 

; 

b21  JfaQa.  -  1.0, 

j 

i  2 

i 

B- " 1,01 

1,04 

1  3 

s 

-%9p-  1,005 ! 

1,05 

j  Table  j 

Comparison  of  the  Estimators  with  the  Methods  Used  In  Sections  II  and  III 


i.  Problem 

Appendix  3 

Section  II 

Section  III 

CEP** 

1 

CEP 

1 

CEP* 

1  , 

T©** 

2 

- - 

CEP 

2 

CEP* 

1 

Case 

i 

1  1 

! 

4.20 

3.97 

4.13 

3.86 

3.64 

3.72 

Case 

2 

4.13 

3.84 

4.10 

3.95 

3.87 

3.88 

Case 

3 

3.53 

3.48 

3.5i  ; 

I 

t 

3.29 

3.28 

3.26 

I2 

!  Case 

l 

3.55 

3.37 

3.50 

3.47 

3.33 

3.55 

Case 

2 

3.50 

3.45 

3.51 

3.48 

3.39 

3.52 

Case 

1 

3 

3.83 

3.77 

3.78 

3.78 

3.71 

i 

3.74 

is 

Case 

1  I 

4.76 

4.17 

4.69 

4.76 

4.66 

5.02 

Case 

2 

4.10 

3.65 

4.03 

3.52 

3.56 

4.21 

Case 

3 

3.69 

3.36 

•3.65 

3.7S 

3.52 

3.72 

B.4  Comparison  Of  The  Two  Estimates!  Relative  Efficiency 

Throughout  this  section  it  is  assumed  that  Vi  "Ty  »  7"  and  f  •  0. 
It  can  be  proven  that  CEP**  lias  greater  efficiency  than  any  other 
unbiased  linear  sample  statistic  rhen  the  mean  value  is  (0»0).  In  case 


III 


the  moan  is  not  zero  but  is  known  to  be  small,  this  estimate  should  be 


considered.  Ci2?|*  is  asymptotically  efficient  whatever  the  population 

mean  may  be,  hence,  if  the  mean  is  2rcatly  different  from  (0,0),  CUP** 

will  be  a  better  estimate  than  CEP^*.  dowever,  because  2  degrees  of 

freedom  are  lost  in  estimating  the  coordinates  of  the  mean,  the  estimate 

CEP&*  will  not  be  as  precise  as  CEP**  for  small  values  of  (u  ,u  ). 

*  1  x  y 

In  order  to  determine  whether  to  use  CEF**  or  CEPjy*  when  it  is 
known  that  the  true  mean  is  close  to  (0,0),  it  is  necessary  to  compare 
the  two  estimates  by  some  criterion.  The  method  which  will  be  used  is 


the  ratio  of  the  relative  efficiencies.  When  CEP**  and  CEP**  are  used. 

1  2 


the  formula  is 
(3.19)  - 


r  g-rfl  .  E [(§-?)*-] 

EL(t-rn 


-cepTI 


This  coqparison  may  be  done  by  assuming  that  the  true  mean  is 

either  some  point  (u  tu  )  or  (0,Q).  In  the  case  that  the  assumption  is 

x  y 

made  that  the  true  mean  is  (u  ,u  )  the  joint  density  function  is 

X  y 

(0U0)  f (*,;•;  vy'r>  '  Tfhp  Jj 


'.Jheri  it  is  assumed  that  the  true  mean  is  0,0),  the  joint  density 
function  Is 


(B.21)  S(x.y[  o.o.r  )  - 

The  development  of  the  ratio  assuming  that  the-  true  mean  is  (0,0)  follows 
the  procedure  applied  in  formula  (5,13),  The  result  is 

/•>;/>'/)  1 7* 

r  •  j  ,  .  I  I 


o.22)  r)l  - 


11? 


fafo-n 

r^-i) 


(3.2»  E[(f  —  r)A]= 


r 


Combining  formulas  (3,22)  and  B,23)» 

/'Xn)f(A>*n 


(8.24)  a.?.  - 


CL^j) 

r  *  (v-  i) 


the  ratio  function  is 
-/ 


VJhen  the  mean  is  (0,0),  the  ratio  function  in  (3,24)  is  les.  than 
1  for  all  n.  Table  k  presents  values  of  the  ratio  function  for 
n  -  2(1)20,  25(5)50,  P.B,  ibranda  tables  this  ratio  for  n  »  2(1)3, 


If  it  is  know  that  th«  true  mean  is  at  soma  point  (u  ,u  )  than 

X  / 

formula  (3*20)  is  the  Joint  density  function  of  ths  component  errors* 
Tits  U*F»  ratio  for  this  case  vjs  dovulopec  by  1\3.  Jbranda  in  rsferencs 
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(3),  In  order  to  find  the  mean  square  deviation  of  CEP**,  the  same 

procedure  can  be  followed  as  in  formula  (3,13)  and  the  result  is  the 

same  ns  formula  (3,22),  The  mean  square  error  of  CEP**  is  a  function 

of  u  and  u  ,  Ibranda  assumed  for  ease  of  computation  that  u  ■  k  T 
x  y  X  1 

and  Uy  -  V'  , 


Lotting  u  be  defined  by 


(3,25)  u  - 


>(X-/..^) 

‘  1  7"  * 


)  u  has  a  non- central  chi 


squared  distribution.  Values  of  K.F.  shorn  in  Table  1  (an  -  erpt  from 
Table  (!)  in  reference  (3))  were  obtained  by  putting  U  -  k^,  and  vary¬ 
ing  k  from  0(, 1)1,0,  The  results  of  this  derivation  show  that  as  n 
Increases,  the  ratio  function  decreases  for  a  constant  value  of  !:.  It 
can  be  ascertained  from  this  table  that  for  large  n,  CEPJ*  will  be  the 
boat  estimate  unless  k  equals  zero  and  CEP**  will  be  best  for  small  n 
and  small  values  of  k.  The  practical  use  of  the  ratio  under  these 
••sumptions  require  the  use  of  estimates  to  obtain  the  values  of 
•nd  kj  and  although  not  txact,  may  still  supply  some  useful  information. 


Table  1 

(B  (CEPl  - 

cap)2 

E 

Ccep2  - 

CSP)2  ) 

0.0 

0,1 

0.2 

0.3 

0,4 

0.5 

0.6  0.7 

0.8  0.9 

1.0 

* 

.481 

.487 

,503 

.530 

.575 

.639 

.723  .549 

1.01  1.21 

1.45 

3 

,656 

,663 

.685 

.727 

.796 

.903 

1.06  1.28 

4 

,743 

.751 

.777 

.829 

.919 

1.06 

1.28 

* 

,796 

,604 

,834 

.893 

1.00 

1.18 

i 

6 

.830 

,838 

,069 

,937 

1.06 

1.23 

7 

.824 

,364 

,896 

.972 

1.12 

fl 

,073 

.884 

.917 

1,00 

1.16 

114 


A  possible  procedure  for  using  IVblc-  1  i o  as  fellows: 

*  /\  ^ 

."■-.e  valuer,  of  it  and  y  arc  first  confuted,  Then  Vi  and  VL  are  com¬ 
puted  using,  formulas  (P-.ll)  anc!  (3.15)  respectively.  The  estimated 


value  of  and  1<^  will  then  equal 


(P.2?)  k 


m  v 

1  V 


''2  m2 
2  f 


where  V  — 


-  EjLS 
a 


'Jains  k.  +  and  a,  an  analysis  of  table  1  ..ay  show  when  C21** 

i.  •  *  * 


k« 


in  not  the  best  estimate.  The  reader  should  be  cautioned  that  no 
attempt  has  been  made  to  theoretically  justify  this  procedure* 

In  order  to  better  illustrate  the  above,  the  competed  values  from 
the  example  problems  for  case  1  are  shot/n  in  table  m. 


P"  To  hie  ra 

.  _ _  _ _ _ Confuted  Values  of  k . 


Problem 

<i:-io) 

X 

7 

hodol  I 

:x>ddf  hT  y 

Kl 

“2  | 

f, 

% . ]  -'l 

3.2'  "j  3.3 

t 

! 

1 

1*2 

2.0 

3.5 

.364 

.605  | 

.434 

2 

1.0 

.1 

3.0 

3.0  ;  3.0 

.333 

.033  I 

t 

.133 

3 

i 

.6 

.3 

4.1 

4.2  !  4.1 

.146 

.073  | 

.10? 

.  .. 

Analysis  of  Table  m  nsing  the  Above  Values 


1  Cor  1<  ■“  ,4b A,  It.r.  >  1  for  all  n  >',  therefore  CEP|*  is  best 
estimate, 

2  for  k  »  .20,  *  ,?17  <  l,  for  n  •  3,  therefore  C2P**  is 

slightly  better, 

3  for  i;  -  .100,  X.7.  -  .534  <  1  for  a  -  3.  C5T**  is  better. 


i  1 5 


3es *  Available  Copy 


appe::di::  c 


c.?  ::h  siv/uiats  gauesiai:  eistsisutiok 


C.  ]  Introduction 

This  appendix  discusses  the  details  of  the  integration  introduced 
in  Ceccions  II  and  IV,  The  integration  of  an  ellipse  or  a  circle  oven 
the  bivariate  normal  density  function  may  be  simplified  by  making  the 
transformation  explained  in  Appendix  A,  That  is  from  equation  (A.15) 

^•1)  !  j  <.(x,y)dxdy  “  f  {  Z-  w(u»v)dudv  where 


Z.:  „0-!,v)  = 


1 _  e*P  ’  I  /  u2  +  V2  \] 

^  ^  l  ~  {  Vit  V '}  . 


C.2  Integration  Over  Circle 

The  probability  that  a  random  point  (1,7)  will  lie  within  a  circle 
with  center  at  the  origin  and  radius  kTxis  written  as 


(C.2)  P(  |/u2  +  V2  -f  UZ,  )  <,  ffzCtV (u, 

fuUV'i  kV* 


v)dudv  . 


The  two  illustrations  in  figure  C.l  show  the  geometric  area  of 


integration. 


6  "Circular  Error  Probabilities"  by  H.  Leon  Harter  /4/  of  Aeronau¬ 
tical  Research  Laboratories. 


3ost  Available  Copy 
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Integration  of  the  3ivariate  Density  Function  Over  a  Circular  Region 

Figure  C.l 


In  order  to  simplify  equations  (C.l)  and  (C.2)  let 


U  V 

(C.3)  —  «  n-.cosQ,  — -  »  msin», 

v«  V* 


then 


/fat  <- 


where 


1  = 


,)IM  i© 


now  let  C  -  S'  ,S/A/le  t- cos'e 

or  k 


and  the  probability  is 


P(K.t)  --  COSl0<-l]j 

mi 

let  col's  =  £  (it COWS),  0 s a®i  2  *  yp,  and  the  probability  la 


Afc* 


/>(*,<:)  d  ff-  ffe*P(-Z  [{£*«•/)  +  (C-l)  COf  0]}  dl  Jt 
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•cn  initial;  --it!:  i.  .',p.*ct  to  Z  and  the  probability  becomes 


!  n  ril 


/Y*,c) 


ac_  /  /-  ev[- ^c^co^jj  ,, 

^  1  (cxt!)  +(Cx-l)COS<t 


This  Torn  is  integrated  using  the  trapozoidal  rule  and  utilizing 
computers  to  do  the  integrating. 

For  example,  the  curve  below  represents  some  function  that  we  v?ish 
to  integrate  over  the  designated  interval.  We  can  divide  the  interval 
into  equal  sub  intervals  (d$)  and  sum  all  of  the  sub  intervals.  As  the 
sub  intervals  become  smaller,  the  accuracy  of  this  type  of  integration 
l o cones  better  and  this  summation  technique  approaches  the  actual  area 
under  the  curve. 


Trapezoidal  Technique  of  Integrating  Under  a  Curve  by  ^.nation 

Figure  d.2 


.lie  formulation  for  integration  with  n  intervale  t*  ut*  becomes 


(..O  Pc;:,  .-)  - 


m  >  i-»i&2/4:2)  C(;2u)  ♦  tc>.i)  ±js?j. 

TT  21  +  (0-1)  X.  ” 

|H:o 


•dicre  d  »  JT 
n 
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hi  ccv.?'.iU  r  cm.  i'l ",  the  w  ,11. C  fu.ik.ticn  is  usually  changed  to 
the  '.chebysehorf*  polyno;  -.ial  or  some  other  scheme  that  converges  more 
rapidly  than  the  o-hilhC  I unction.  This  can  be  shown  by  comparing  the 
convergence  or  the  tv:c  methods.  If  vie  let  COGS  «  x  ■  T  (x) ,  then 
Xra(x)  «  C031B  «  7a(CO&) 


The  formula  for  integration  now  becomes 


(C.7)  f  (  A , C  )  = 


7T 


to-o 


This  summation 


is  now  made  for  different  values  of  k,c,  and  F(k,c) . 


C.3  Integrating  Over  an  Ellipse 

The  probability  that  a  random  point  (b,V)  will  lie  within  an  ellipse 
with  center  at  the  origin  is  written  as 


(C.3) 


i  f  jj  ;  v?<k| 

L  Ah  V  J 


1  f  (  exp^-h  (  n2  +  v2  )l 

TTV*V*  J  )  L  KT*  tr'J 

ii>h*  * 


dudv 


The  two  illustrations  In  figure  C.3  show  the  geometric  area  of  integration 


Volume  of  Integration  of  ellipse  .'or.  c  l  b*  Flano  Cutting 

i  variate  density  .Amotion  g.^v(u,v)  Farsi  lei  to  u,v  Plane 

figure  C.3 
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Note  that.  it  the  variances  are  equal,  this  form  is  also  circular, 
the  three  dimensional  form  being  a  perfect  bell  and  the  two  dimensional 
fori;:  being  a  circle. 

In  order  to  further  simplify  this  form  let 

( C, 9)  u  =  7^00 29,  v  *»  >+1  VJ>  SINS,  then  the  probability  becomes 

P(k/I,9)  -  1  (f  exp  I  T I  JiH  where 

2  7 U  ) 

1*<K 


thus, 

K 

exp  j-lj  M  </  Q  f 

O 

formula  0,10  car.  be  integrated  directly  by  first  integrating  with 
respect  to  3  and  then  vita  respect  to  m,‘  After  integrating  with  respect 
to  t,  the  formula  becomes 

(£*n)  10;,  .)  -  f 

o 


Cc. lo) 


r(hAs)  »  f 

:>7 r  J 


If  wo  let  t“...",  the  probability  statement  becomes 

kx 

(d.12)  I‘(!v,t)  »  ’i  fexp  -'.t  dt  where  f  ,(t)  is  the  chi  squared  density 

o  4 

function  with  tu>  degrees  of  freedom  as  defined  in  formula  (4.5) 

Aiat  is 

( 

for  any  value  of  P(k,t)  the  value  of  can  be  obtained  frost  table  4 

by  entering  with  i (k,t)  and  2  degrees  of  freedom, 

(  4 
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-ms)  nT*ri -?[]£+$<*'  ] 


/-  e  v 


» 


i 


